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support  and  understanding  during  the  past  eighteen  months. 
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Abstract 
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This  thesis  compares  two  modified  maximum  likelihood 
(ML)  estimation  techniques  against  three  minimum  distance 
(MD)  estimation  techniques  in  application  to  the  three  para¬ 
meter  lognormal  distribution.  The  three  parameter  lognormal 

X  » 

distribution  has  a  location  parameter  (t)  and  two  other 

(Y  r  j  r\ 

parameters  associated  with  the  mean  ( ii )  and  standard  devia¬ 
nt  I'v 

tion  (cf)  of  its  parent  normal  population.  The  first  modified 
ML  technique  uses  linear  interpolation  on  order  statistics 
to  estimate  location  while  the  second  ML  technique  uses  the 
first  order  statistic  as  the  location  estimate.  The  remaining 
two  parameters  are  calculated  by  using  the  location  estimate 
in  their  respective  censored  or  uncensored  ML  equations  and 
solving  for  the  parameters.  Three  MD  techniques  are  used: 
Kolmogrov  Distance,  Cramer-von  Mises  Statistic,  and  the 
Anderson-Darling  Statistic.  The  MD  techniques  refine  the 
location  estimates  which  are  then  used* in  the  ML  equations 
of  the  other  two  parameters  to  obtain  their  refined  estimates. 

Monte  Carlo  analysis  is  used  to  accomplish  the 
comparison  of  estimation  techniques.  Sample  sizes  of  6,  8, 

10,  12,  and  16  are  generated  using  tft^ree  parameter  sets 
(V,a,£)  -1  (0.0,  1.0,  10.0),  (1.0,  1.0,\10.0),  and  (1.0,  2.0, 

\ 

vii 
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10.0)  Eadh  estimation  technique  is  applied  to  one-thousand 
replications  for  every  combination  of  sample  size  and  para- 
meter  set.  Three  measures  of  effectiveness  are  used  to 
facilitate  comparisons:  mean  square  error,  relative 
efficiency,  and  the  Cramer-von  Mises  Statistic.  Comparisons 
of  these  effectiveness  measures  across  all  cases  reveal  a 
clear  superiority  of  the  MD  techniques  over  the  modified  ML 
techniques. 

At 
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ROBUST  MINIMUM  DISTANCE  ESTIMATION  OP 
THE  THREE  PARAMETER  LOGNORMAL 
DISTRIBUTION 

I .  Introduction 

Statistical  analysis  is  continuousl  flaying  an  ever- 
increasing  role  in  all  branches  of  the  pret  day  military 
services.  Being  of  such  diverse  application,  statistical 
analysis  is  being  used  for  optimally  operating  under  con¬ 
strained  budgets,  determining  manpower  and  equipment  require¬ 
ments,  and  increasing  the  level  of  operational  readiness  and 
availability.  Procurement  of  multi-million  dollar  weapon 
systems,  desired  increases  in  the  prediction  of  system  and 
component  reliabilities,  and  logistics  planning  all  usually 
involve  statistical  analysis  before  the  decision-making 
process  begins.  The  application  of  more  efficient  and 
accurate  statistics  to  these  and  other  areas  leads  to  better 
informed  decision  makers  and  more  effective  decisions  being 
made,  thereby  strengthening  our  military  forces  while 
simultaneously  operating  under  the  imposed  constraints. 

One  very  important  aspect  of  statistical  analysis  is 
that  of  parameter  estimation  of  distributions.  Often  times 
an  analyst  can  look  at  a  histogram  of  some  data  and  determine 
which  density  function  belongs  to  the  data  set.  However,  in 
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most  cases,  simply  identifying  the  data's  density  function 
will  not  yield  vital  information  such  as  the  mean,  variance, 
and  standard  deviation.  If  an  analyst  had  some  reliability 
data  on  a  piece  of  operational  equipment  and  was  asked  to 
determine  some  statistics  such  as  the  mean  time  between 
failures  (MTBF)  or  the  mean  time  to  repair  (MTTR) ,  simply 
fitting  a  density  function  by  graphical  methods  and  estimating 
its  parameters  would  most  surely  yield  inaccurate  estimates 
at  best.  The  military  services  recognize  the  importance  of 
accurately  determining  these  parameter  estimates  and  have 
devoted  much  time  and  money  to  pursue  this  end.  For  example, 
a  commander  of  an  F-16  squadron  would  certainly  like  to  know, 
as  accurately  as  possible,  how  often  the  aircraft  breaks  down 
and  how  much  time  is  needed  to  repair  or  replace  the  faulty 
component (s) .  These  and  other  factors  bear  directly  on  the 
survivability  and  operational  capability  of  the  squadron,  not 
to  mention  national  defense.  In  order  to  obtain  these  vital 
statistics,  the  parameters  of  an  assumed  distribution  must 
be  estimated  from  sample  data.  Of  course,  better  estimation 
procedures  result  in  more  accurate  parameter  estimates  and 
yield  more  accurate  information  about  the  data.  With  the 
importance  of  statistical  analysis  and  parameter  estimation 
in  mind,  this  thesis  effort  focuses  on  parameter  estimation 
as  applied  to  the  three  parameter  lognormal  (3-LN)  distribu¬ 
tion. 
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The  method  of  moments  and  maximum  likelihood  are 
two  classical  parameter  estimation  techniques  with  the  method 
of  maximum  likelihood  usually  outperforming  the  method  of 
moments.  However,  a  recent  technique  called  the  minimum 
distance  method  developed  by  J.  Wolfowitz  in  the  1950s,  seems 
to  yield  better  parameter  estimates  than  those  from  the  method 
of  maximum  likelihood  (3;  18;  25).  This  thesis  applies 
both  the  minimum  distance  method  ana  the  method  of  maximum 
likelihood  to  the  three  parameter  lognormal  distribution  and 
compares  the  estimates  from  both  methods.  A  Monte  Carlo 
analysis  conducts  the  investigation.  Both  methods  are  dis¬ 
cussed  extensively  in  Chapters  IV  and  V  respectively. 

« 

Dr  Albert  H.  Moore,  professor  of  statistics  at  the 
Air  Force  Institute  of  Technology,  and  his  past  thesis 
students  have  devoted  much  time  and  study  to  the  area  of 
parameter  estimation.  Over  the  past  several  years,  he  and 
his  thesis  students  have  been  extensively  involved  with  the 
application  of  robust  minimum  distance  estimation  techniques 
to  a  variety  of  distributions.  Some  of  the  distributions 
considered  so  far  are  the  three  parameter  weibull,  the  three 
and  four  parameter  gamma,  the  four  parameter  beta,  the 
generalized  t,  and  the  generalized  exponential  power  distribu¬ 
tion.  Parameter  estimates  from  the  robust  minimum  distance 
estimation  technique  usually  outperform  the  more  classical 
techniques. 


The  objective  of  this  thesis  effort  is  to  estimate 
and  compare  the  parameters  of  the  3-LN  distribution  via  the 
method  of  maximum  likelihood  and  the  minimum  distance  method. 

As  the  name  implies,  the  3-LN  distribution  has  three  para¬ 
meters:  a  location  parameter  and  two  other  parameters 
associated  with  the  mean  and  standard  deviation  of  its  parent 
normal  distribution.  Two  "modified"  maximum  likelihood 
estimation  techniques  are  developed  and  compared  against 
three  minimum  distance  estimation  techniques.  The  procedure 
is  straightforward  and  briefly  stated  here.  Estimates 
obtained  by  the  "modified"  method  of  maximum  likelihood  are 
used  by  the  minimum  distance  method  to  obtain  a  new  estimate 
of  the  location  parameter.  This  new  location  parameter 
estimate  is  then  used  by  the  method  of  maximum  likelihood  to 
obtain  new  estimates  for  the  remaining  two  parameters. 

Lastly,  three  measures  of  effectiveness  are  applied  to  the 
parameter  estimates  to  determine  which  estimation  method 
gives  the  most  accurate  estimates.  Emphasis  is  placed  on 
small  sample  sizes  since  moderately  large  sample  sizes 
(n  >  20)  are  easier  to  handle  due  to  the  asymptotic  properties. 
It  is  anticipated  that  the  minimum  distance  estimates 
will  be  more  accurate  than  those  of  the  "modified"  maximum 
likelihood  method  for  the  3-LN  distribution. 

The  next  chapter  introduces  the  3-LN  distribution  by 
reporting  its  genesis  and  history,  derivation  and  development, 
and  some  important  statistics.  Chapter  III  is  a  short 
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literature  review  consisting  of  some  classical  and  newer 
estimation  techniques  that  have  been  applied  to  the  3-LN 
distribution.  Chapter  IV  discusses  the  method  of  maximum 
likelihood  revealing  its  history,  properties,  and  some 
methods  used  by  statisticians  and  mathematicians  in  solving 
the  maximum  likelihood  equations.  Chapter  V  presents  the 
minimum  distance  theory  and  method.  Chapter  VI  describes 
the  Monte  Carlo  analysis  procedure  and  computerization  of 
the  estimation  methods.  The  final  chapter.  Chapter  VII, 
reports  the  results  and  conclusions  obtained  from  the  Monte 
Carlo  analysis. 
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II.  Three-Parameter  Lognormal  Distribution 


History 

The  normal  distribution  has  played  a  dominant  role 
in  the  field  of  theoretical  and  applied  statistics  ever  since 
its  development  by  Guass  in  1809.  However,  the  normal  curve 
could  not  provide  an  adequate  representation  of  the  many 
different  distributions  found  in  statistical  practice  so,  by 
the  end  of  the  nineteenth  century,  some  statisticians 
attempted  to  construct  systems  of  frequency  curves  represent¬ 
ing  a  variety  of  distributions.  These  were  commonly  referred 
to  as  "skew  frequency  curves"  (13:149).  Among  the  most 
successful  of  these  systems  were  those  developed  by  K.  Pearson 
in  1895,  F.  Y.  Edgeworth  in  1898,  and  C.  V.  L.  Charlier  in 
1905  (13:149).  Pearson's  system  of  frequency  curves  was 
defined  as  the  solution  to  a  differential  equation  involving 
four  parameters  and  Charlier' s  system  was  defined  by  coeffi¬ 
cients  from  the  expansion  of  derivatives  from  the  normal 
distribution  (13:152).  Edgeworth  called  his  system  the 
"method  of  translation"  in  which  a  function  of  the  observed 
random  variable  was  sought  which  closely  approximated  the 
normal  random  variable.  Normal  theory  was  then  applied  to 
the  transformed  variables.  Although  the  "method  of  transla¬ 
tion"  was  not  widely  accepted  due  to  the  limited  variety  of 
possible  shapes,  it  was  this  system  that  greatly  aided  in 
the  advancement  of  the  lognormal  distribution. 
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The  genesis  of  the  lognormal  distribution  arises 
from  a  theory  of  elementary  errors  combined  by  a  multiplicative 
process  instead  of  an  additive  process  as  in  the  case  of  the 
normal  distribution  (2:2).  In  1879,  Galton  pointed  out  that 
many  situations  in  nature  such  as  growth,  change,  or  even 
death  follow  this  multiplicative  process  and  laid  the  founda¬ 
tion  for  the  development  of  the  lognormal  distribution. 

Galton  stated  that  if  x1,x2,...,xn  are  positive  and  inde¬ 
pendent  random  variables  and 

n 

T  =  n  x.  (2.1) 

n  ■  i 


then 


n 

log (TV  =  S  log (x . )  (2.2) 

n  i=l  1 

and  application  of  the  central  limit  theorem  to  the  random 
variables,  log(x^)  would  result  in  the  distribution  of 
log(Tn)  being  approximated  by  the  unit  normal  distribution 
as  the  sample  size  went  to  infinity.  Thus,  the  distribution 
of  Tn  was  called  lognormal  (12:Ch  14,  113).  D.  McAlister 
was  the  first  to  explicity  develop  some  theory  on  the  log¬ 
normal  distribution  in  1879  by  deriving  its  mean,  median, 
mode,  and  second  moment  (2:3).  In  1903,  Kapteyn  further 
established  the  genesis  of  the  distribution  and  also  developed 
a  crude  machine  for  generating  lognormal  samples  from  a  log¬ 
normal  population.  Wicksel  was  first  to  estimate  the  parameters 
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of  the  distribution  by  employing  the  method  of  moments  and 
was  also  the  first  to  consider  the  case  where  a  simple 
displacement  of  the  variate  rather  than  the  variate  itself 
was  lognormal ly  distributed.  The  parameter  which  assigned 
a  value  to  the  displacement  of  the  variate  from  the  origin 
was  referred  to  as  the  threshold  parameter  and  established 
the  3-LN  distribution. 

Since  the  1930s  applications  of  the  lognormal  distribu 
tion  have  steadily  increased.  It  has  been  applied  in  such 
fields  as  agriculture,  entomology,  metallurgy,  biology, 
economics,  and  reliability  with  much  success.  An  extensive 
list  of  applications  can  be  found  at  the  end  of  Chapter 
Thirteen  in  Johnson  (14)  . 

Probability  Density  Function 

By  using  Edgeworth's  "method  of  translation"  in 
which  a  function  of  the  observed  random  variable  is  sought 
which  closely  approximates  the  normal  random  variable, 

Johnson  states  that  the  transformation  of  a  variable,  say 
x,  to  normality  can  be  accomplished  by  a  function,  say 
f(x),  which  has  a  specialized  form  and  is  made  to  depend  only 
on  a  certain  number  of  parameters.  His  transformation  is 
(13:152) : 

2  -  Y+6*f[^j  (2.3) 

where: 
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f  is  a  monotonic  function  of  x  and  does  not  depend  on  any 
parameter 

<5  and  y  are  shape  parameters 
X  is  a  scale  parameter 
£  is  a  location  parameter 
z  is  unit  normal 

The  3-LN  probability  density  function  (pdf)  is  explicity 
derived  by  allowing  the  logarithmic  function  to  represent 
the  function  of  the  transformation  in  Eq  (2.3).  When  natural 
logarithms  are  used,  as  in  this  thesis,  the  scale  parameter 
can  be  removed  (2:6).  Therefore,  Eq  (2.3)  becomes 

z  =  y+6.1n(x-£;)  (2.4) 

Application  of  Eq  (2.4)  to  the  general  form  of  a  normal  pdf 
yields  the  pdf  of  the  displaced  lognormal  variates. 

<5  [-(y+iSln  (x-O  )  2 

£<x)  -  /77<x-l)  exP - 2 -  i*  X  >  5  (2.5) 

»  m 

■  o  if  x  <  e 

Another,  more  common,  expression  of  lognormal  pdf 
uses  the  mean  (y)  and  the  standard  deviation  (a)  of  its  parent 
normal  distribution  and  is  obtained  by  allowing  y  =  -y/<5  and 
a  »  1/5  .  The  pdf  becomes: 


f  (X)  = 


a/ZrT  (x-U 


exp 


-(ln(x-Q-y) 2 


2a1 


if  x  >  5  (2.6) 


=  0  if  x  <  £ 


Figure  1  contains  two  3-LN  plots  with  different 
values  of  y  and  a  . 


Distribution  Function 

The  distribution  function,  commonly  referred  to  as 
the  cumulative  distribution  function  (cdf) ,  of  a  continuous 
random  variable  is  found  by  integrating  its  respective  pdf 
over  a  given  range.  Unfortunately,  the  cdf  for  the  3-LN 
distribution  does  not  have  a  closed  form  solution  so  it  must 
be  integrated  by  numerical  means.  The  cdf,  F  (x) ,  for  the 
3-LN  distribution  is  defined  as  (32:47) : 


F  (x) 


- (ln(u) -u) 2 
2c2 


du 


(2.7) 


where  u  =  x-£ 

Important  Statistics 

Using  the  3-LN  pdf  defined  by  Eq  (2.6)  ,  a  list  of  some 
important  statistics  are  given  below.  The  measures  of  central 
tendency  are  the  mean,  the  median,  and  the  mode  defined 
respectively  as: 
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mean  = 

exp[y+l/2(o2)  ]+£ 

(2.8) 

median  = 

expfy] +£ 

(2.9) 

mode  = 

exp[y-a2]+C 

(2.10) 

Note  that  mean 

The  r  moment 

E^(x-£)rj  = 

>  median  >  mode. 

about  £  is: 

exp  • r+1/2 (r 2 • a2 ) "j 

(2.11) 

The  measures  of  dispersion  are  the  variance  (02) ,  skewness 
(K 3 )  and  kurtosis  (K4)  defined  respectively  by: 


02  =  exp[2y+c2]  •  [exp(o2)-l] 
1 

K3  =  (w+2)*(w-l)  2 

K„  =  w4+2w3+3w2-3 

where  w  «  exp[o2] 

The  standardized  100  a%  deviate  is 

Xa-E [X] 
a  "  c(X) 

where 

Xa  3  exp[v+w]+5 
E [X]  *  mean 


(2.12) 

(2.13) 

(2.14) 


(2.15) 
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<y(X)  =  0 


Ua  =  a-  percentile  from  unit  normal  distribution 

An  interesting  and  easily  verified  fact  is  that  as  a  goes  to 
zero,  the  standardized  lognormal  distribution  approaches  the 
unit  normal  distribution  (see  Figure  1) . 
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III.  Literature  Review 


In  searching  through  the  literature  for  different 
estimation  techniques  that  have  been  applied  to  the  3-LN 
distribution,  several  estimation  techniques  applicable  only 
to  the  two  parameter  lognormal  distribution  were  encountered. 
It  was  interesting  to  note  that  the  estimation  techniques 
for  the  two  parameter  lognormal  distribution  were  usually 
straightforward  and  the  estimates  computed  with  relative 
ease.  However,  with  the  introduction  of  the  single  location 
parameter,  resulting  in  the  3-LN  distribution,  the  estimation 
techniques  become  quite  complicated  and  inaccurate.  This 
finding  is  due  to  the  fact  that  estimation  of  the  location 
parameter  is  extremely  difficult. 

Classical  Estimation  Techniques 

The  most  popular  classical  estimation  techniques, 
as  applied  to  the  3-LN  distribution,  are  described  in  the 
following  paragraphs. 

The  method  of  moments  was  formulated  by  K.  Pearson 
in  1894  and  is  extremely  simple  in  concept.  Suppose  that 
the  pdf  of  a  random  variable,  say  x,  is  a  function  of  h 
unknown  parameters,  say  0i,  02,  . ..,  9^,  then  by  equating  as 
many  sample  moments  to  population  moments  as  there  are 
unknown  parameters,  the  equations  can  be  solved  for 
0i,  02,  ...,  0^,  thereby  yielding  the  parameter  estimates 
(7:130).  To  avoid  any  ambiguity,  the  definition  of  a  moment 
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is  pursued.  There  are  two  types  of  moments  related  to  both 
the  sample  and  population:  a  moment  about  the  origin  called 
a  moment  and  a  moment  about  the  mean  called  the  central 
moment.  The  kfc^  sample  moment,  m'k  ,  is  computed  from  the 
sample  as  follows: 


i  n  y 

-  1  x/ 
n  i=1  1 


(3.1) 


where  n  is  the  sample  size  and  x^  denotes  the  i^*1  observation. 
The  population  moment,  y*k  ,  is  derived  from  the  population 
as 


y'k  =  E[xk]  (3.2) 

The  kth  sample  central  moment,  m^  ,  is  ralculated  from  the 
sample  by 


n 

E  (x.-x)k 
i=l 


(3.3) 


where  x  is  the  sample  mean.  Finally,  the  kth  population 
central  moment,  y^,  is  derived  from  the  population  as 


yk  “ 


E 


[(x-y)k 


(3.4) 


where  y  *  E[X]  is  the  mean  of  the  distribution.  Since  the 
population  moment  is  a  function  of  the  population  parameters 
setting  up  the  equations 


y  k  *  m'k  k  »  l, . . .  ,p 


(3.5) 
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where  p  is  the  number  of  parameters  being  estimated, 
establishes  a  system  of  p  equations  and  p  unknowns.  These  p 
unknowns  may  be  solved  for  yielding  the  method  of  moments 
estimators: 


a  =  An  (1+u2)  (3.6) 

y  =  -|[ln(m2)  -ln(u2  (1+u2) )]  (3.7) 

^  I 

l  -  -  e^  (1+u2 )  2"  (3.8) 

where  u  is  the  solution  to 

i_ 

u3  +  3u  =  (w-l)2(w+2)  (3.9) 

and  w  =  exp£a2j 


From  the  sample  and  population  central  moments  skewness  and 
kurtosis,  two  very  useful  statistics,  denoted  by  K3  and  K4 
respectively,  may  be  obtained  by 


SLa 

(m2)  ^ 


K3 


Us 

<U2>  ^ 


k4 


m4 

(m2) 2 


k4 


- 

(  U  2  )  2 


(3.10) 

(3.11) 

(3.12) 

(3.13) 
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The  'hat'  (~)  is  used  to  identify  the  sample  statistics. 

In  the  case  of  the  3-LN  distribution,  Eqs  (3.11)  and  (3.13) 
yield  Eqs  (2.13)  and  (2.14)  respectively.  Equations  (3.10) 
and  (3.11)  are  used  later  in  the  thesis  to  provide  an 
initial  estimate  for  the  a  parameter.  Wicksell,  Gumble, 

Yuan,  and  Aithcison  and  Brown  have  all  applied  the  method  of 
moments  to  the  3-LN  distribution.  The  latter  found  that  the 
method  of  moments  is  very  inefficient  when  compared  to  other 
estimation  methods  (14:Ch  14,124).  This  may  be  due  to  the 
fact  that  the  moment  sequence, },  of  the  lognormal  distribu¬ 
tion  is  not  unique  to  the  distribution  and,  therefore,  cannot 
be  defined  by  its  moments  (14:Ch  14,  115).  Thus,  the  method 
of  moments  is  presented  but  not  employed  in  this  thesis. 

The  method  of  quantiles  was  also  applied  to  the  3-LN 
distribution  to  estimate  the  parameters.  This  method  results 
in  three  quantiles,  say  q,  1/2,  1-q,  where  0  <  q  <  1/2. 

It  is  clear  from  Eq  (2.4)  that  the  value  of  x  such  that 
P [x  <  Xg]  =  q  is  related  to  the  corresponding  percentile, 

1/q,  of  the  unit  normal  distribution  by  x  =  £+exp[y+v  -a] 
and  that  the  median  of  the  lognormal  sample  (corresponding 
to  q  =  1/2  is  £+exp[y]  .  Denoting  the  sample  quantile 
of  order  q  by  x^  and  the  quantiles  of  order  q  from  the  unit 
normal  population  by  vg  such  that  v^  ^  =  “vq  =  v  an<* 
after  some  rearranging,  results  in  the  following  set  of 
estimators: 
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a  =  i|ln(x.  -x i ) -In (xi-x) 

VL  1_q  7  T  <3 

(3.14) 

y  =  ln(xi-x  ) -In (1-exp [-voj ) 

2  *3 

(3.15) 

c  =  x_i_— exp  [y  ] 

(3.16) 

2 


and  explicit  estimates  for  y,  cr,  and  £  are  obtained  (2:58). 
Aitchison  states  that  as  a  general  rule,  q  should  be  set 
at  0.05  (2:58) . 

A  final  and  very  interesting  method  of  parameter 
estimation  was  developed  by  W.  F.  F.  Kemsley  in  1952  which 
uses  a  mixture  of  the  method  of  moments  and  quantiles.  He 
equates  the  sample  mean  x  to  the  population  mean  which  is 
equivalent  to  replacing  x^  in  Eqs  (3.14),  (3.15),  and  (3.16) 
by  x  and  derives  the  function: 

x— x 

f(o2)  =  - - §=  (3.17) 

xl-q  x 

from  which  o  is  determined.  The  estimates  for  y  and  £  are 
then  easily  found  by  inserting  the  value  of  a  in  Eq  (3.15), 
solving  for  y,  then  solving  for  £  using  y  in  Eq  (3.16). 

Again,  a  general  rule  is  to  let  q  ■  0.05 

In  1957,  Aitchison  and  Brown  (2)  compared  these  three 
estimation  methods  and  reached  the  conclusion  that,  with 
q  *  0.05  ,  the  method  of  quantiles  was  better  than  Kemsley' s 

method,  and  that  both  were  considerably  better  than  the 
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method  of  moments  (2:62-63).  Unfortunately,  Aitchison  and 
Brown  did  not  apply  the  method  of  maximum  likelihood  to  the 
3-LN  distribution,  but  they  did  apply  it  to  the  two  para¬ 
meter  lognormal  distribution  and  found  that  the  maximum 
likelihood  estimates  were  far  more  accurate  than  the 
estimates  obtained  by  the  method  of  moments  or  quantiles 
(2:53).  With  this  in  mind,  it  is  anticipated  that  the 
method  of  maximum  likelihood  should  also  perform  better  than 
the  method  of  quantiles  for  the  3-LN  distribution.  Thus, 
the  method  of  maximum  likelihood  is  used  in  this  thesis 
and  is  extensively  discussed  in  Chapter  IV. 

New  Estimation  Techniques 

In  searching  through  the  recent  literature,  several 
articles  pertaining  to  parameter  estimation  of  the  3-LN 
distribution  were  discovered.  Dallas  R.  Wingo  (28)  estimated 
the  parameters  by  taking  the  conditional  log-likelihood  func¬ 
tion  of  an  ordered  random  sample  of  n  independent  observa¬ 
tions,  subject  to  several  linear  constraints  and  applied  a 
nonlinear  program  to  obtain  the  estimates.  Using  these 
estimates  in  a  series  of  penalty  functions  and  maximizing 
these  functions  for  a  sequence  of  decreasing  values  of  the 
penalty  parameter,  he  asserts  that  the  corresponding  solu¬ 
tion  sequences  converges  to  the  solutions  of  the  nonlinear 
program.  Thus,  estimates  of  y,  a,  and  £  are  obtained.  This 
estimation  technique  was  applied  to  several  lognormal  data 
sets  which  suggested  that  the  algorithm  is  quite  practical  and 
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easy  to  implement.  However,  a  major  problem  with  Wingo's 
approach  is  that  the  control  parameter  is  arbitrarily  chosen 
which  presents  several  difficulties.  If  the  control  para¬ 
meter  is  initially  too  large,  the  penalty  function  will  be 
easy  to  maximize,  but  the  algorithm  may  converge  to  a  point 
that  is  not  the  solution  to  the  constrained  problem  (28:58). 

On  the  other  hand,  if  the  control  parameter  is  too  small,  the 
penalty  function  becomes  very  difficult  to  maximize  (28:59). 

In  1970,  Munro  and  Wixley  (21)  estimated  the  para¬ 
meters  based  on  order  statistics.  In  their  approach,  they 
take  the  expectations  of  the  order  statistics  and  express  them 
as  functions  which  are  linear  in  a  location  and  scale  para¬ 
meter  but  nonlinear  in  a  shape  parameter  of  the  distribution. 
Regressing  the  order  statistics  on  their  expectations  and 
implementing  an  iterative  technique  results  in  weighted  least 
squares  estimates  of  all  three  parameters  (21:212).  A  Monte 
Carlo  analysis  of  this  methodology  revealed  that  the  iterative 
procedure  always  converges,  nearly  unbiased  estimates  are 
obtained,  and  the  variances  for  the  location  and  scale  para¬ 
meter  compare  favorably  with  those  of  the  order  statistic 
estimates  of  the  standard  normal  distribution  (21:222). 

A  variant  on  Wingo's  method  was  developed  by  Gibbons 
and  McDonald  (9)  in  1975.  Using  the  expected  values  and 
covariance  matrix  of  the  standardized  2-LN  distribution, 
they  obtain  the  best  linear  unbiased  estimators  (BLUEs)  for 
£  and  a.  The  BLUEs  fair  well  against  the  maximum  likelihood 
estimates  for  sample  sizes  of  five  and  under.  However,  their 
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method  is  severely  limited  as  it  can  only  be  applied  when 
the  shape  parameter  is  fixed  at  unity  (9:290). 

In  addition  to  these  parameter  estimation  techniques, 
several  variations  of  the  method  of  maximum  likelihood  have 
been  applied  to  the  3-LN  distribution.  As  such,  these 
variations  are  discussed  in  the  chapter  describing  the 
method  of  maximum  likelihood. 
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IV.  Method  of  Maximum  Likelihood 

By  far  the  most  popular  method  of  parameter  estima¬ 
tion  has  been  and  still  is  the  method  of  maximum  likelihood. 

The  concept  of  maximum  likelihood  was  proposed  by  Daniel 
Bernoulli  in  1778  and  was  later  used  by  Guass  in  his  develop¬ 
ment  of  the  theory  of  least  squares  in  1796  (7:135).  Gauss 
had  used  the  method  of  maximum  likelihood  and  felt  that  such 
estimation  was  inferior  to  least  squares  estimation.  As  a 
result,  the  development  of  the  maximum  likelihood  method  was 
overlooked  until  R.  A.  Fisher  reintroduced  the  method  in  1912. 
Since  that  time,  the  maximum  likelihood  method  has  been  applied 
to  numerous  distributions  and  has  enjoyed  widespread  success. 

As  will  be  discussed  later,  the  maximum  likelihood  estimators 
(MLEs)  possess  several  desirable  properties  such  as 
asymptotic  unbiasedness,  asymptotic  efficiency,  consistency, 
sufficiency,  and  invariancy.  It  is  mostly  due  to  these 
properties  that  the  maximum  likelihood  method  is  used  in 
this  thesis. 

The  principle  of  maximum  likelihood  consists  in 
accepting  as  the  best  estimate  of  the  parameters,  say 
0i,  02,  •••»  those  values  of  the  parameters  which 
maximize  the  likelihood  for  a  given  set  of  observa¬ 
tions  say,  xi,  x2,  ...,  xn  [24:150]. 

Understanding  what  is  meant  by  ".  .  .  maximize  the  likelihood 
for  a  given  set  of  observations  [24:150]"  requires  the  intro¬ 
duction  of  the  likelihood  function  and  its  definition. 
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Let  xi,  x2,  . ..,  xn  be  a  set  of  independent  random 
samples  taken  from  a  population  described  by  the  density 
function  fn(xi,  x2,  ...»  xn/0i,  82,  ...,  8k)  where 
9x,  02,  . ..,  9k  are  unknown  parameters  of  the  density 
function.  The  likelihood  function,  L,  is  then  defined  by: 


L  L  (x  i ,  x2  ,  «  •  •  ,  Xj^/0\,  02,  •  •  •  ,  0k ) 
f  n  (x2 ,  ^2 1  •••/  6 1  f  0  2 ,  •».,  0]^) 

=  f(Xi/01#  02,  ...,  0k)  *f  (x2/0i ,  02, 

•  •  •  ,  £ (xn/ ® 1 >  0  2  r 


n 

=  n  f . (x. /0i,  02 , 
i=l  1  1 


®k) 


6k>  r 

0k)  (4.1) 


(20:278)  . 

The  likelihood  function  gives  the  "likelihood"  that  the 
random  variables  assume  a  particular  value  of  xi,  x2 ,  ...,  xn. 
In  other  words,  it  is  a  means  of  determining  from  which 
density  function  a  set  of  values  would  most  likely  have  come 
from.  Maximization  of  the  likelihood  function  yields  the 
estimated  values  0 i ,  0  2 ,  . . . ,  §k  for  the  unknown  parameters 
and  are  called  the  maximum  likelihood  estimators  of 
0i,  02,  ...»  0k  respectively.  The  maximization  process  is 
accomplished  by  taking  the  partial  derivatives  of  the  likeli¬ 
hood  function  with  respect  to  each  unknown  parameter,  setting 
the  equations  equal  to  zero  and  then  solving  for  the  para¬ 
meters. 
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(4.2) 


yjj-  =  0  i  =  1,2,3, ...,k 
i 

Since  L  and  log(L)  attain  their  maximum  at  the  same  value  of 
9^,  it  is  usually  computationally  more  efficient  to  transform 
L  into  log(L)  as  this  transforms  the  product  into  a  sum  which 
facilitates  differentiation. 

Properties 

The  widespread  use  of  the  method  of  maximum  likeli¬ 
hood  is  attributed  to  the  resulting  estimators  being 
asymptotically  unbiased,  asymptotically  efficient,  consistent, 
sufficient,  and  invariant.  The  estimator,  0,  is  said  to  be 
unbiased  if 

E [91  *  0  (4.3) 

the  expected  value  of  the  estimator  is  equal  to  the  estimate. 

Or  simply  stated,  §  is  unbiased  if  the  mean  of  its  distribution 
equals,  f(9),  the  function  of  the  parameter  being  estimated 
(20:293) .  Efficiency,  mainly  a  large  sample  concept,  is 
linked  with  the  smallest  asymptotic  variance  among  a  class  of 
estimators  which  yields  rapid  convergence  for  the  estimator 
(20:155).  An  alternate  definition  is  due  to  the  Cramer-Roa 
Inequality  which  sets  a  lower  bound  on  the  variance  of  the 
estimate.  If  equality  at  the  lowest  bound  holds,  then  the 
estimator  is  said  to  be  an  efficient  estimator  (7:139).  The 
property  of  consistency  implies  that  the  accuracy  of  the 
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estimate  increases  as  the  sample  size  increases  (24:151). 
Sufficiency  merely  indicates  that  the  maximum  likelihood 
estimates  contain  all  the  information  about  the  unknown 
parameters  that  is  contained  in  the  seunple  and  that  the 
conditional  distribution  of  the  sample  given  the  value  of 
the  statistic  does  not  depend  on  the  parameters  (20:301). 

A  final  and  important  property  of  the  MLEs  is  the  invariance 
property  defined  as  "Let  9  be  the  MLE  of  0  in  the  density 
function  f(xi,  X2,  . ..,  xn) .  If  t(*)  is  a  function  with  a 
single  valued  inverse,  then  the  MLE  of  t(0)  is  x(9)  [20:284]." 

For  example,  it  is  well  known  that  the  MLE  of  the  variance 
of  a  normal  density  function  is 

a2 ( 0)  =  ;?  (xi-y o ) 2  (4.4) 

ni=l 


where  y0  is  the  mean  of  the  distribution.  Then  by  the 
invariance  property,  the  MLE  of  a  is 


a 


(x^y  o) 


2 


(4.5) 


or  similarly,  the  MLE  of  log (a2)  is 


log(o2)  =  log 


(4.6) 


This  property  is  extremely  convenient  when  transformations  are 
required. 
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Three -Parameter  Lognormal  Estimation 

In  searching  through  the  literature,  only  five 
articles  dealing  with  maximum  likelihood  estimation  of  the 
3-LN  distribution  were  uncovered.  The  first  attempt  was 
by  E.  Wilson  and  J.  Worchester  in  1945,  followed  by  A.  C. 
Cohen  in  1951,  B.  M.  Hill  in  1963,  J.  A.  Lambert  in  1964, 
and  finally  H.  Harter  and  A.  Moore  in  1966.  The  following 
paragraphs  present  a  short  summary  of  each  methodology  along 
with  the  maximum  likelihood  equations. 

Employing  the  method  of  maximum  likelihood,  as 
previously  described,  results  in  the  maximum  likelihood 
equation  for  each  parameter  being: 


y 


n  [ln(Xj-£)  ] 
Z  - 


(4.7) 


n  -  n  InUi-O 

(ff2-v>  Z  ^  : - 0 

i=lxi  i=i  ? 


(4.9) 


(6:207).  It  is  clear  that  if  any  of  these  equations  are 
solved  for  £,  the  remaining  parameters  could  be  easily 
obtained  by  algebraic  manipulation.  Unfortunately,  Eq  (4.9) 
cannot  be  solved  explicity  for  thus,  herein  lies  the 
difficulty  of  parameter  estimation  for  the  3-LN  distribution. 
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Wilson  and  Worchester  attempted  to  solve  the  maximum 
likelihood  equations  by  a  trial  and  error  method.  This 
resulted  in  computational  inefficiency  and  extremely  poor 
parameter  estimates  as  could  be  expected. 

A.  C.  Cohen  (6)  presented  a  more  feasible  and 
efficient  technique  for  solving  the  maximum  likelihood 
equations.  He  substituted  Eqs  (4.7)  and  (4.8)  into  Eq  (4.9) 
yielding  a  function,  f(£)  of  the  location  parameter.  The 
solution  of  this  function  is  found  by  inverse  interpolation 
on  a  small  interval  say  (£i,  £2)  where  f  ( C i. )  <  0  .  The 
estimated  value,  %,  is  then  substituted  into  Eqs  (4.7)  and 
(4.8)  resulting  in  y  and  a.  He  also  uses  a  technique  based 
on  the  least  sample  value  to  estimate  £.  This  technique 
summarized  below  requires: 


xq-E,  =  exp[y+0*to] 


(4.10) 


where  x0  =  Xi  +  _  with  xi  being  the  least  sample  observation, 
6  the  interval  of  precision,  and  t0  determined  from  the 
relationship 


k 

n 


to 


— /  exp 
/2? 


—  00 


(4.11) 


in  which  k  is  the  number  of  times  the  least  observed  value 
occurs  in  the  sample  (6:209).  Taking  the  natural  logarithms 
on  both  sides  of  Eq  (4.10)  and  substituting  Eqs  (4.7)  and 
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(4.8)  into  the  logarithm  of  Eq  (4.10)  gives  a  function, 
f  (£)  of  the  location  parameter  equivalent  to  the  maximum 
likelihood  equation  for  £.  A  Monte  Carlo  analysis  of  these 
techniques  showed  that  the  method  based  on  the  least  observed 
sample  values  performs  better  than  the  inverse  interpolation 
method . 

A  significant  fact  with  respect  to  the  maximum  likeli¬ 
hood  estimation  of  the  3-LN  distribution  was  reported  by 
B.  M.  Hill  in  1963.  He  proved  that  there  exists  a  path, 
henceforth  referred  to  as  the  "path  of  no  return,"  along  which 
the  likelihood  function,  L (£,y(£),  a2(£))  goes  to  °°  as  £  goes 
to  Xi,  the  least  observed  sample  value,  and  goes  to  a  positive 
constant  as  £  goes  to  (11:73).  This  leads  to  the 
ridiculous  statement  that  the  maximum  likelihood  estimates  of 
S,  y,  and  a2  are  xi,  -00,  and  +°°  respectively  (11:75).  To 
overcome  this  difficulty,  he  introduces  a  joint  prior 
distribution  for  £,  y,  and  a  then  applies  Bayes  Theorem. 

This  approach  leads  to  the  conclusion  that  the  solution  to 
the  formal  likelihood  equations  should  be  used  with  t 
satisfying 

=  0  (4.12) 


n 


E  (x.-C)  + 
i=l  1  o(£) 


n 

E 


i=l  Ui-S) 


where 


(ln(xj-£)-y(U  ) 

.  A 

aU) 


(4.13) 
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and  {1(C),  o(£)  satisfying  Eqs  (4.7)  and  (4.8)  respectively 
with  £  being  replaced  by  £  (14:Ch  14,  132). 

In  1964,  Lambert  (16)  conducted  an  empirical  investiga¬ 
tion  of  Hill's  proposed  estimation  methodology.  Using 
twenty-three  different  samples,  each  with  different  values 
for  the  parameters  (£  was  always  zero),  he  found  the 
estimates  fairly  inaccurate  in  a  majority  of  the  samples. 

Also,  when  a2  was  less  than  0.04  or  larger  than  4.0,  the  pro¬ 
cedure  had  a  tendency  to  diverge. 

The  final  article  found  on  maximum  likelihood  estima¬ 
tion  of  the  3-LN  distribution  was  coauthoried  by  Drs  H. 

Harter  and  A.  Moore  (10)  in  1966.  Recognizing  that  an 
algebraic  solution  to  the  maximum  likelihood  equations  is 
impossible  and  that  the  likelihood  function  may  take  the 
"path  of  no  return"  which  yields  ridiculous  estimates,  as 
proven  by  Hill,  they  develop  an  iterative  technique  to  solve 
the  maximum  likelihood  equations.  When  the  likelihood  func¬ 
tion  gets  on  the  "path  of  no  return,"  a  modification  of  the 
technique  is  employed  which  circumvents  this  problem.  Their 
technique  is  very  flexible  in  that  it  allows  for  samples 
censored  both  from  above  and/or  below.  A  Monte  Carlo  analysis 
was  conducted  with  various  parameter  values,  sample  sizes,  and 
censorings.  In  cases  of  no  censoring,  and  all  parameters 
unknown,  the  mean  of  each  estimator  is  very  close  to  the  true 
parameter  values  with  the  corresponding  variances  being  very 
small.  These  values  indicate  that  the  iterative  technique 
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provides  good  estimates  of  the  parameters.  Since  this  tech¬ 
nique  has  been  validated,  verified,  and  yields  accurate 
estimates  of  the  parameters,  a  modification  of  this  method 
is  used  in  this  thesis  as  one  of  the  bases  for  determining 
the  parameter  estimates  by  the  method  of  maximum  likelihood. 

The  iterative  procedure  is  straightforward  and 
involves  estimating  the  three  parameters,  one  at  a  time  in 
the  cyclical  order  y,  a,  and  £  assuming  all  parameters 
unknown.  First,  the  observations  are  ordered  and  initial 
estimates  chosen.  The  initial  estimate  of  £  is  chosen  as 
the  first  order  statistic.  Secondly,  the  iterative  technique 
procedure  begins  where,  at  each  step,  the  rule  of  false 
position  (iterative  linear  interpolation)  is  employed  to 
determine  if  the  value  of  the  parameter  being  currently 
estimated  satisfies  its  respective  maximum  likelihood  equation 
given  the  latest  and/or  known  values  of  the  other  parameters 
(10:848).  The  possibility  of  encountering  the  "path  of  no 
return"  occurs  when  no  value  of  £  in  the  permissible  interval 
£  <  Xi  satisfies  Eq  (4.9).  In  these  cases,  the  likelihood 
function  is  monotonically  increasing  so  that  l  =  xx  , 
y-*- ,  and  a-'-t00  (10:848).  Fortunately,  the  modified 

procedure  alleviates  this  problem  by  censoring  the  smallest 
observation  and  all  those  equal  to  it.  The  initial  estimate 
of  £  is  then  equated  to  the  smallest  uncensored  observation 
and  the  iterative  procedure  is  reapplied.  The  censored 
observation (s)  is  subsequently  not  considered  in  the 
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estimation  procedure,  but  it  does  become  an  upper  bound  on 
X .  With  the  likelihood  function  bounded,  finite  estimates 
of  the  remaining  two  parameters  can  be  calculated.  Harter 
and  Moore  suggest  the  iteration  continue  until  either  the 
results  from  successive  steps  agree  with  some  assigned 
tolerances  or  a  specified  number  of  steps  is  reached. 
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V.  The  Minimum  Distance  Method 

Minimum  distance  (MD)  estimation  is  a  relatively  new 
development  in  the  field  of  statistical  parameter  estimation. 
It  was  first  introduced  in  the  1950s  by  J.  Wofowitz  as  a 
method  which  .  .  in  a  wide  variety  of  cases,  will  furnish 
super  consistent  estimators  even  when  classical  methods  .  .  . 
fail  to  give  consistent  estimators  [30:9]."  In  1957,  he 
published  a  fundamental  paper  proving  the  consistency  of  the 
estimator  along  with  several  examples  of  its  use.  Basically, 
Wolfowitz's  technique  was  to  let  <S(Fi,F2)  =  sup [Fx (x) -F2  (x) ] 
be  a  measure  of  the  discrepancy  between  two  distribution 
functions,  and  emphasized  the  applicability  of  this  method 
for  a  broad  range  of  distance  techniques.  From  that  time  on, 
the  minimum  distance  method  slowly  evolved.  Working  with 
absolutely  continuous  distributions,  Blackman  (4)  proved 
consistency  and  asymptotic  normality  for  an  estimator  of 
location  via  the  MD-method.  In  1969,  Knusel  (15)  investigated 
the  "robustness"  of  MD  estimators  and  showed  that  they 
exhibit  the  same  properties  as  the  class  of  maximum  likeli¬ 
hood  type  estimators  (M-estimators) .  tae  term  "robustness" 
refers  to  the  ability  of  an  estimator  to  adopt  to  deviations 
in  the  underlying  model  while  remaining  efficient  (23:3). 

Then,  in  1970,  Sahler  contributed  a  major  piece  of  work  in 
the  area  of  minimum  distance  estimation.  As  Parr  described  it 
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Sahler  systematically  defines  D-estimators 
[minimum  distance  type  estimators]  and  outlines 
conditions  for  their  existence  and  consistency. 

In  particular,  he  considers  discrepancies  of 
integral  type,  and  proves  an  asymptotic  normality 
result  for  the  general  (unidimensional)  parameter 
estimation  [22:9]. 

It  has  not  been  until  recently  that  statisticians 
began  applying  minimum  distance  techniques  for  the  estimation 
of  distribution  parameters.  In  particular,  Parr  and  Schucany 
(23)  applied  the  MD  method  to  estimate  the  location  parameters 
of  several  symmetric  distributions  with  emphasis  placed  on 
the  normal  distribution.  They  concluded  that  the  method 
yielded  "...  strongly  consistent  estimators  with  excellent 
robustness  properties  [23:5]."  The  MD-method  has  also  been 
applied,  via  Monte  Carlo  analysis,  to  a  variety  of  symmetrical 
and  nonsymmetrical  distributions  at  the  Air  Force  Institute 
of  Technology  as  theses  research  under  the  supervision  of 
Dr  Albert  H.  Moore.  Most  of  the  theses  have  concluded  that 
the  MD-estimators  provide  more  accurate  parameter  estimates 
than  classical  estimation  methods,  even  the  method  of  maximum 
likelihood.  This  thesis  is  yet  another  application  of  the 
MD-method  to  another  distribution:  the  3-LN  distribution. 
Results  from  the  Monte  Carlo  analysis  should  prove  quite 
interesting  in  this  case.  Mos+  '■v  the  other  applications 
involved  symmetrical  or  near-symmetrical  distributions,  of 
which  the  3-LN  distribution  is  usually  neither.  The  shape 
of  this  distribution  may  take  on  a  variety  of  forms  ranging 
from  positively  to  negatively  skewed  and  very  peaked  to  not 
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very  peaked  depending  on  the  parameter  values.  It  will  be 
interesting  to  investigate  how  well  the  MD-estimators  react 
under  the  variety  of  distribution  shapes.  Nevertheless,  it 
is  anticipated  that  the  MD-estimators  will  provide  more 
accurate  parameter  estimates  than  the  maximum  likelihood 
estimates . 

The  minimum  distance  technique  is  an  extension  of 
the  goodness-of-f it  tests  used  in  hypothesis  testing.  In 
goodness-of-f it  testing,  the  analyst  is  concerned  with 
determining  whether  or  not  a  random  variable  follows  a 
particular  distribution  given  certain  values  of  the  parameters. 
This  testing  is  accomplished  by  constructing  a  distribution 
function,  call  it  F(x:0)  where  G  represents  known  parameter 
values,  and  then  determining  how  well  it  fits  the  sample 
distribution  function  referred  to  as -the  empirical  distribu¬ 
tion  function  or  EDF .  The  common  measure  of  fit  is  usually 
the  distance  between  F(x:0)  and  the  EDF.  The  most  popular 
EDF  is  a  nondecreasing  step  function  of  size  1/n,  where  n 
is  the  sample  size,  corresponding  to  the  ordered  sample 
points.  This  EDF  convention  is  employed  in  this  thesis. 

The  minimum  distance  method  is  somewhat  a  reverse  of  the 

* 

goodness-of-f it  philosophy  in  that  the  analyst  assumes  the 
probability  density  function  of  a  random  variable  and  attempts 
to  estimate  its  parameters.  Minimum  distance  estimation 
merely  takes  as  its  estimates  of  0  those  values  which  minimize 


the  discrepancy  between  F(x:0)  and  the  EDF.  See  Figure  2 
for  a  plot  of  an  estimated  cdf  against  its  EDF. 

Further  discussion  of  MD  concepts  necessitates  the 
use  of  some  notation  which  will  be  utilized  throughout  this 
chapter.  Let  r  =  {FQ(*)/9eQ}  represent  the  parametric 
family  of  three  parameter  lognormal  distributions.  Fq(*) 
defines  the  estimated  3-LN  distribution  using  the  parameter 
estimates  found  by  the  maximum  likelihood  method  and 
corresponds  to  the  parameter  space.  Let  Gn(*)  denote  the 
EDF  based  on  a  random  sample  from  the  true  but  unknown 
distribution  function  G(*).  The  EDF  is  a  nondecreasing  step 
function  of  size  1/n  at  each  point  xi,  x2,  . ..,  xn  as  pre¬ 
viously  defined  and  has  a  value  of  one  at  infinity  (30:10). 
Finally,  6(Gn,  F0)  denotes  the  discrepancy  between  the  two 
distribution  functions.  Thus,  the  "D-estimators"  mentioned 
in  Parr's  quote  on  page  33  is  the  "...  value  in  ft 
minimizing  (Gn,  FQ)  over  fi  [22:6]." 

Since  the  distance  between  two  distribution  functions 
is  used  in  determining  the  parameter  estimates,  the  distance 
measure  is  of  utmost  importance.  In  their  paper,  Parr  and 
Schucany  (23)  describe  the  most  commonly  applied  distance 
measures,  and  it  is  from  this  paper  that  most  of  the  following 
definitions  are  taken  (23:4-5). 


35 


Fig  ure  2.  EDF  vcrauo  Eet 


Weighted  Kolmogrov  Distance 

The  weighted  Kolmogrov  distance  is  defined  as: 

D^(Gn,FQ)  =  supx|Gn(x)-FQ(x)  |iMF0(x)  )  (5.1) 

This  is  one  of  the  most  popular  distance  measures  as  a  result 
of  its  use  in  the  Kolmogrov-Smirnov  goodness-of-f it  tests. 

The  discrepancy  measure  minimizes  the  maximum  distance 
between  Gn(x)  and  F0 (x)  when  evaluated  at  each  of  the  sample 
points.  iHFqM)  is  a  weighting  function  which  allows  the 
user  to  assign  a  weight  to  each  observation.  In  this  thesis, 
uniform  weights  of  1.0  are  assigned  to  each  observation. 

Weighted  Cramer-von  Mises  Distance 

The  weighted  Cramer-von  Mises  distance  is  defined  as: 

+oo 

WJ(Gn,Fe)  =  /(Gn(x)-FQ(x))2iMF0(x))dFQ(x)  (5.2) 

—  00 

This  technique  minimizes  the  discrepancy  between  the  theoretical 
and  empirical  distribution  function,  whereas  the  Kolmogrov 
distance  only  finds  the  absolute  differences  between  the  two. 

The  weighting  function  is  set  equal  to  one  resulting  in  the 
Cramer-von  Mises  statistic  (CVM) . 

Anderson-Darling  Statistic 

The  Anderson-Darling  statistic  is  defined  as 

+°°  . 

A?(Gn,F0)  =  /(Gn(x)-F0(x))7T^I)dF0(x)  (5.3) 

*  —  00 
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Careful  observation  should  reveal  that  this  statistic  is  a 


special  case  of  the  Cramer-von  Mises  distance  where 

^ (Fg  (x) )  =  - (^_U)  and  u  =  F0 (x)  .  This  weighting  places 

more  emphasis  on  the  tails  of  the  distribution.  The  process 
also  minimizes  the  area  between  the  theoretical  and  empirical 
distribution  functions. 

Kupier 1 s  Maximal  Interval  Probability  Statistic 
The  Kupier  statistic  is  defined  as: 

V(Gn,F0)  =  sup_M<a<b<+jGn(b)-Gn(a))-(FQ(b) 

-F0 (a) ) |  (5.4) 

This  statistic  is  closely  related  to  the  Kolmogrov  distance 
where  the  parameters  a  and  b  define  the  maximal  probability 
interval.  The  method  yields  more  accurate  estimates  for  a 
distribution  scale  parameter  than  for  a  location  parameter. 

Watson  Statistic 

The  Watson  statistic  is  defined  as: 


+  CO 


1+00 


Uz(Gn,F0)  .  / (Gn (x) -F 0 (x) ) zdF0 (x)  + 


/  (Gn(x) 


-F0  (x)df  0  (x) 


(5.5) 


and  is  related  to  the  Cramer-von  Mises  distance.  As  with  the 
Kupier  statistic,  this  technique  is  more  appropriate  for 
estimating  the  distribution  scale  parameter  than  the  location 
parameter. 
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The  discrepancy  measures  listed  represent  some  of  the 
more  popular  D-estimators .  The  popularity  of  these  estimators 
is  attributed  to  their  invariance  property,  similar  to  that 
of  the  maximum  likelihood  estimators. 

This  invariance  is  a  result  of  the  estimator 
not  taking  advantage  of  the  function  g(0)  of  the 
point  6  e  to  be  estimated,  instead  it  simply 
selects  a  best  approximating  distribution  [18:27]. 

Or  as  Parr  stated  it: 

It  may  well  be  inquired  as  to  why  an  estimator 
obtained  by  minimization  of  a  discrepancy  measure 
which  is  useful  in  goodness-of-f it  purposes  (and, 
hence  in  many  cases,  extremely  sensitive  to  out¬ 
liers  or  general  discrepancies  from  the  model) 
should  be  hoped  to  possess  any  desirable  "robustness" 
properties.  It  turns  out  that,  in  most  cases 
(although  no  for,  say.  A2  while  the  discrepancy 
measure  itself  may  be  fairly  sensitive  to  the  pre¬ 
sence  of  outliers,  the  value  8  which  minimizes  the 
discrepancy  6(Gn,  F0)  is  much  less  so  [23].. 

Since  this  thesis  is  concerned  with  the  application 
of  minimum  distance  estimation  to  determine  the  location  para¬ 
meter  for  the  3-LN  distribution,  the  Kolmogrov  distance,  the 
Cramer-von  Mises  statistic,  and  the  Anderson-Darling 
statistic  are  employed. 
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VI .  Monte  Carlo  Analysis 

This  chapter  deals  extensively  with  the  procedures 
used  in  conducting  the  empirical  investigation  and  comparison 
of  the  interpolative  and  iterative  modified  maximum  likeli¬ 
hood  (ML)  estimators  (described  later)  along  with  each  of 
the  minimum  distance  (MD)  estimators.  Specifically,  data 
generation,  computerization  of  estimation  techniques,  and 
evaluation  criteria  are  covered  in  detail.  In  this  research, 
Monte  Carlo  analysis  is  used  to  evaluate  the  properties  of 
the  estimators.  Basically,  Monte  Carlo  analysis  of  estima¬ 
tion  methods  consists  of  the  following  three  steps: 

1.  Generate  independent  random  variables  from  a 
specific  distribution  (the  3-LN  distribution  in  this  case) 
to  form  random  samples  of  a  given  size. 

2.  Apply  the  estimation  method (s)  to  the  random 
samples  to  obtain  the  parameter  estimates. 

3.  Compare  the  estimates  from  each  estimation 
method (s)  using  one  or  more  evaluation  criteria. 

Since  Monte  Carlo  analysis  requires  a  vast  amount  of  data  and 
many  computations,  a  high  speed  computer  is  a  necessity.  For 
the  purposes  of  this  analysis,  the  Control  Data  Corporation 
(CDC)  6600  computer  located  at  the  Aeronautical  Systems 
Division,  Wright-Patterson  Air  Force  Base,  Ohio  is  used. 

Three  programs  are  developed  to  carry  out  this  research.  Each 
program  is  written  in  FORTRAN  V  and  makes  use  of  several 
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subroutines  from  the  International  Mathematical  and  Statistics 
Library  (IMSL)  (12)  .  The  first  program  generates  samples  of 
size  n  from  a  3-LN  distribution  and  calculates  the  inter- 
polative  and  iterative  modified  ML-estimators .  The  second 
program  calculates  the  MD-estimators .  And  the  third  program 
evaluates  the  estimates  from  each  estimation  technique. 

Before  proceeding  further,  some  notation  and  comments  are 
noted: 

1.  Let  po/  do,  and  Co  represent  initial  estimates  for 
y,  a,  and  C • 

A  A  /V 

2.  Let  y,  c,  and  C  represent  the  ML-estimates  of 

y,  c,  and  £• 

A  A  A 

3.  Let  y,  a,  and  C  represent  the  MD-estimates  of 

y,  a,  and  C. 

4.  Let  represent  the  i  order  statistic  from 

a  sample. 

5.  All  estimates  are  calculated  to  three  significant 
digits,  except  for  the  Kolmogrov  estimates  which  are  to  two 
significant  digits. 

6.  Recall  that  in  the  3-LN  pdf  and  cdf  (Eqs  (2.6) 
and  (2.7))  there  is  an  In  (X-C)  term;  therefore,  no  estimate 
of  C  can  be  greater  than  or  equal  to  the  first  order 
statistic  (X^j),  otherwise  the  term  goes  to  minus  infinity 
and  the  pdf  and  cdf  are  undefined. 

7.  Evaluation  of  the  3-LN  cdf  at  point  X'  is  always 
computed  by  evaluating  its  respective  standard  normal  cdf 
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with  appropriate  limits.  The  percentile  points  are  identical. 
A  proof  follows: 

Let  the  transformation  be 


t 


In  (x-g)  -y 

a 


(6.1) 


then 


dt 


1 

a (x-g) 


■dx 


Now  since  the  3-LN  cdf  is  defined  by 


(6.2) 


F  <x’ )  = 


(X-g)  OvTlf 


exp 


-(ln(x-g) -u) 


2a‘ 


dx  g <x '  <°°  (6.3) 


application  of  the  transformation  yields 


F(t') 


dt 


—  0O<’£1  <00 


(6.4) 


F(t' )  is  easily  recognized  as  a  standard  normal  cdf  and 
F  (x* )  =  F(t')  . 

Generation  of  Data 

As  previously  mentioned,  the  objective  of  this 
research  is  parameter  estimation  of  the  3-LN  distribution 
with  emphasis  on  small  samples  sizes.  As  such,  sample  sizes 
of  6,  8,  10,  12,  and  16  are  generated.  For  each  of  these 
sample  sizes,  three  combinations  of  y  (the  mean  of  the  parent 
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normal  distribution)  and  o  (the  standard  deviation  of  the 
parent  normal  distribution)  are  used:  (0.0,  1.0),  (1.0, 

1.0),  and  (1.0,  2.0).  The  location  parameter  is  always 
equal  to  10.0  since  different  values  of  £  only  result  in  a 
translation  along  the  x-axis  and  does  not  affect  the  estima¬ 
tion  of  the  other  two  parameters.  The  combinations  of  sampL 
sizes  and  parameters  result  in  a  total  of  fifteen  different 
cases. 

For  each  case,  1,000  replications  of  sample  size  n 
are  generated  for  use  in  the  analysis.  Caso's  study  of 
Monte  Carlo  validity  recommended  2,000  replications;  however 
he  also  stated  that  1,000  replications  would  be  sufficient 
(5:37).  Since  runs  of  2,000  iterations  are  very  prohibitive 
in  terms  of  computer  CPU  time,  1,000  replications  are  used. 
Generation  of  the  3-LN  deviates  is  accomplished  using  the 
IMSL  routine  GGLNG  and  adding  the  value  of  the  location  para¬ 
meter  to  each  deviate.  GGLNG  generates  2-LN  deviates,  given 
the  values  of  u  and  a  as  input;  the  addition  of  the  location 
parameter  forms  the  3-LN  random  deviates.  Each  sample  is 
then  sorted  in  ascending  order  by  the  IMSL  subroutine  VSRTA. 
With  the  desired  samples  in  hand,  the  next  step  is  to 
calculate  the  interpolative  and  iterative  modified  ML- 


estimators. 


Computerization  of  Maximum 
Likelihood  Estimators 

Interpolative  Maximum  Likelihood  Estimators .  It  was 
mentioned  in  Chapter  IV  that  if  the  location  parameter  is 
known,  then  y  and  a  are  easily  computed  using  Eqs  (4.7)  and 
(4.8)  respectively.  In  this  thesis,  however,  £  is  assumed 
to  be  unknown  and,  therefore,  must  be  estimated.  Unfortunately, 
it  is  very  difficult  to  estimate  £.  As  stated  in  Chapter  IV, 
iterative  procedures  must  be  applied  if  all  three  parameters 
are  to  be  estimated  via  the  ML-method.  It  certainly  would 
be  very  convenient  if  a  simple  method  existed  where  an 
accurate  estimate  of  £  could  be  found;  thus,  with  this 

A  /"V 

estimate,  y  and  a  could  be  easily  computed.  In  searching 
through  the  literature,  one  such  method  was  discovered.  This 
method  involves  calculating  the  median  ranks  (Yif  i  =  1,  2) 
of  the  first  two  order  statistics  (X^j,  X^).  Then  the 
slope  of  the  line  connecting  the  first  two  points  (i.e., 

(Xj^j,  Y^)  ,  (X  ^2)  >  Y2^  ^erive(3  and  interpolated  down  to 
the  x-axis.  The  point  at  which  the  slope  intersects  the 

a  /v 

x-axis  is  taken  as  the  value  for  £.  Now,  with  £  known, 

Eqs  (4.7)  and  (4.8)  are  used  to  obtain  the  estimates  for 
y  and  a . 

This  method  of  linear  interpolation  for  estimation 
of  location  parameters  was  originally  developed  in  a  masters 
thesis  by  Second  Lieutenant  D.  E.  Bertrand  (3) .  He  used  this 
procedure  on  the  four  parameter  Beta  distribution  to  locate 
the  upper  and  lower  location  parameters  and  obtained 
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excellent  results.  This  method  is  extremely  desirable  for 
the  estimation  of  £ since  the  interpolation  will  always  result 
in  a  C  being  less  than  the  first  order  statistic.  Further¬ 
more,  it  appears  intuitively  obvious  that  when  the  pdf  is 
skewed  to  the  left,  as  in  the  3-LN  pdf,  the  interpolation 
approximation  of  location  should  be  very  close  to  its  true 
value.  Although  consistency  of  the  interpolative  estimate 
was  not  proven  by  Bertrand,  it  seems  apparent  that  as  the 
sample  size  increases,  the  first  two  order  statistics  would 
get  closer  to  the  true  value  of  the  location  parameter, 
thereby  making  the  interpolation  approximation  a  better  fit 
to  the  tail  of  the  true  cdf  (3:19). 

The  interpolative  ML-estimator  methodology  is  given 

below: 

1.  Arrange  the  sample  deviates  in  ascending  order. 

2.  Calculate  the  median  ranks  of  the  first  two 
order  statistics  using  the  formula  (3:31): 

1  ■  !’2  «6-5' 

3.  Find  the  slope  (m)  of  the  line  between  (X^j, 
yx)  and  (X(2),  Y2)  by 

m  =  <Y2-Y1)/(X(2)"X(1))  (6.6) 

4.  The  estimate  £  is  the  point  at  which  the  slope 
(m)  intersects  the  x-axis  and  is  given  by 
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(6.7) 


e  = 


-Y^/m 


The  method  is  preser.ted  graphically  in  Figure  3. 

Although  the  estimators  from  this  method  are  not 
"strict"  ML-estimators  in  the  statistical  sense,  they  are 
believed  to  yield  extremely  good  estimates.  Appendix  B 
contains  a  computer  listing  of  the  interpolative  ML-estimator 
algorithm. 

Iterative  Maximum  Likelihood  Estimators .  As  stated 
previously,  the  interpolative  ML-estimators  are  not  "strict" 
ML-estimators  in  the  statistical  sense,  so  an  algorithm 
designed  to  calculate  the  "true"  ML-estimators  was  sought. 

At  first,  the  iterative  ML-estimator  algorithm  developed  by 
Drs  Harter  and  Moore  was  used.  The  purpose  of  their  algorithm 
was  to  study  the  asymptotic  properties  of  ML-estimators  and, 
therefore,  used  sample  sizes  of  50,  100,  and  200.  They  con¬ 
stantly  warn  throughout  their  paper  (10)  that  with  small 
sample  sizes:  (1)  the  likelihood  function  may  have  no  clearly 
defined  local  maximum,  (2)  the  first  order  statistic  might 

A 

have  to  be  used  as  C  to  prevent  the  likelihood  function  from 
going  to  infinity,  and  (3)  the  iteration  procedure  might  be 
slow.  Their  suppositions  were  confirmed.  Using  sample 
sizes  of  12  and  smaller  along  with  different  combinations  of 
the  values  for  y  and  a,  their  algorithm  set  %  equal  to  X^j 
over  99  percent  of  the  time  and  convergence  was  extremely 
slow;  in  some  cases  the  estimates  diverged. 
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These  problems  led  to  the  development  of  a  new 
iterative  ML-estimator  algorithm  designed  to  combat  the 
difficulties  associated  with  the  Harter  and  Moore  algorithm. 
This  new  algorithm  follows  along  the  same  lines  as  the  Hart 
and  Moore  algorithm  except  that  only  y  and  a  are  estimated 
one  at  a  time  in  cyclical  order  while  £  is  equated  to 
The  IMSL  subroutine  ZSCNT  which  solves  a  series  on  nonlinear 
equations  by  a  variation  of  the  secant  method  is  used  to 

A  A 

obtain  u  and  a.  The  ML-equations  for  y  and  a  are  put  in 
separate  ZSCNT  subroutines.  When  y  is  being  estimated,  the 
latest  estimate  of  a  is  held  constant  and  vice-versa  until 
both  estimates  converge.  Initially,  the  ML-equations  were 
combined  in  one  ZSCNT  subroutine  but  convergence  could  not 
reached  to  three  significant  digits  in  1,000  iterations  so 
the  ML-equations  were  put  into  separate  subroutines.  The 
choice  of  initial  estimates  for  input  into  the  ML-equations 
is  crucial.  Good  initial  values,  especially  for  a  is  vital 
to  this  algorithm  if  accurate  estimates  are  desired.  The 
following  initial  values  were  investigated  (£  always  equals 
X(i)) j  (1)  y0  =  sample  mean,  a0  =  sample  standard  deviati 

(2)  y0  =  In  (sample  mean),  a0  =  In  (sample  standard  devia4: 

(3)  y0  =  In  (sample  median) ,  a0  =  a  derived  by  equating 
sample  and  population  skewness  and  solving  for  a.  Results 
from  these  runs  showed  that  a  is  very  sensitive  to  c0  while 

A 

y  is  relatively  insensitive  to  y0.  In  this  thesis,  the  thir 
set  of  initial  values  is  used  as  input  to  the  modified  item 
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ML-estimator  algorithm  since  they  give  the  most  accurate 
estimates. 

Although  this  algorithm  does  not  produce  "strict" 

A 

MLEs  in  a  statistical  sense  (since  £  =  X|^)  the  resulting 
estimators  from  this  procedure  "...  appear  to  possess  most 
of  the  desirable  properties  usually  associated  with  maximum- 
likelihood  estimates  [10:843]."  Since  the  constraint 

A  A  A 

£  =  is  imposed  when  calculating  V  and  a,  the  censored 

(from  below)  ML-equations  must  be  used;  they  are  (10:844): 


3L 

3U 


n 

I 

i=r+l 


r  f(Zr+l) 

nw 
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(6.8) 
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where 


ln(xi-£) -u 
a  ’ 


f(z.)  =  — —  exp  f-z. 2 /2 

1  /TP  L  1 

Zi 

F  (zi)  =  /  f  (t)  dt 


with  n  as  the  sample  size  and  r  as  the  number  of  observations 
censored  from  below. 

The  iterative  methodology  is  presented  below: 

1.  Arrange  sample  deviates  in  ascending  order. 
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2.  Calculate  the  initial  estimates:  yo  =  In  (sample 

median) ,  o0  =  a  derived  by  equating  sample  and  population 

/\ 

skewness,  £  = 

3.  Given  the  initial  estimates  and  £  =  as 

input,  recursively  estimate  u  and  a,  using  censored  ML- 
equations,  until  the  estimates  agree  within  a  certain 
tolerance  or  a  maximum  number  of  iterations  is  reached. 
Appendix  B  contains  a  computer  listing  of  this  algorithm. 
Attention  is  now  turned  to  the  computerization  of  the  MD- 
estimators. 

Computerization  of  Minimum 
Distance  Estimators 

The  main  thrust  of  this  research  is  to  estimate 
the  location  parameter  of  the  3-LN  distribution  by  minimum 
distance  (MD)  estimation  and  then  use  the  estimate  to  obtain 
refined  estimates  of  u  and  a  via  their  ML-equations.  It  is 
anticipated  that  these  refined  estimates  are  more  accurate 
than  the  estimates  from  the  modified  methods  of  maximum 
likelihood.  As  described  in  Chapter  V,  the  MD-estimators 
are  obtained  by  finding  values  for  the  parameters  which 
minimize  the  discrepancy  between  an  empirical  and  an 
estimated  distribution  function.  In  this  thesis,  only  the 
location  parameter  is  minimized,  while  the  other  two  para¬ 
meter  estimates,  determined  from  the  modified  ML-methods, 
are  held  fixed.  The  empirical  distribution  function  is  the 
1/n-  step  function  and  the  estimated  distribution  is  obtained 


*  [ 


by  using  y  and  a  along  with  the  most  recent  MD-estimate  of 
£.  Once  £  is  determined,  it  is  used  in  the  censored  or 

uncensored  (depending  on  whether  or  not  £  =  ML- 

x  * 

equations  of  y  and  a  which  yields  y  and  a.  This  procedure 
is  done  using  the  estimates  from  the  interpolative  and 
iterative  modified  ML-methods.  Three  minimum  distance 
estimators  are  investigated:  the  Kolmogrov  Distance,  the 
Cramer-von  Mises  Statistic,  and  the  Anderson-Darling  Statistic. 

Kolmogrov  Distance  Estimators .  The  computational 
formula  which  allows  for  calculation  of  the  Kolmogrov 
Distance  is  defined  by  Stephens  as  (26:731): 

D+  =  max(l  <  i  £  n)  [(i/n)  - 

D-  =  max(l  <.  i  <  n)  [ z ^  -  ( i— 1 )  /n] 

D  =  max(D+,D”)  (6.9) 

where  z-^  is  the  estimated  standard  normal  cdf  evaluated  at 
the  i*"*1  sample  point.  To  calculate  the  Kolmogrov  Distance, 

£  is  shifted  two  units  to  the  right  and  left  of  itself.  The 

original  shift  is  to  the  left  and  £  is  moved  at  .01  steps. 

With  each  iteration,  the  maximum  distance  between  the 
theoretical  and  empirical  distribution  functions  are  recorded. 
Of  course,  location  estimates  are  not  allowed  to  go  within 
.001  of  Xjjj  since  this  implies  divergence  of  the  estimate. 
Also,  if  the  final  estimate  is  constrained  by  the  highest  or 

A  A  A  ^ 

lowest  incremental  value  (i.e.,  £  =  £+2.0  or  £  =  £-2.0), 
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a  new  initial  estimate,  which  eliminates  this  constraint, 
is  provided  and  the  iterations  begin  anew.  The  final  result 
is  to  choose  the  incremental  value  which  minimizes  the  maximum 
distance  between  the  two  distribution  functions.  The  actual 
recalculation  of  the  new  minimum  distance  location  parameter 
is  calculated  by: 


£  «  £  +  (-2.0  -+  .01  •  COUNT)  (6.10) 

where  COUNT  is  a  counter  which  locates  the  point  at  which  the 
maximum  distance  between  the  estimates  and  empirical  distribu¬ 
tion  functions  is  minimized.  The  Kolmogrov  MD-estimators 

of  y  and  a  are  calculated  by  using  £  in  the  censored/ 

£  £ 

uncensored  ML-equations  of  y  and  a,  thus  yielding  y  and  a. 

Cramer-von  Mises  Estimators.  Next,  the  program 
calculates  tne  Cramer-von  Mises  MD-estimators.  Recall  that 
this  distance  measure  finds  the  discrepancy  between  the 
estimated  and  empirical  distribution  functions.  The  computa¬ 
tional  formula  for  this  measure  is  given  by  (26:731): 

n 

W2  =  E^[Zi  -  (2i-l)/2nj2  +  (l/12n)  (6.11) 

where  z^  is  the  estimated  standard  normal  cdf  evaluated  at 
the  ifc^  sample  point  and  n  is  the  sample  size.  The  minimiza¬ 
tion  of  the  location  parameter  is  accomplished  via  the  IMSL  sub¬ 
routine  ZXMIN.  The  subroutine  uses  a  Quasi-Newton  method  to 
minimize  a  function  of  one  or  more  variables  and  is  identical 
to  the  one  used  by  Parr  in  his  PhD  dissertation.  With  this 
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method,  y  and  a  are  held  constant  while  £  is  being  minimized 
by  ZXMIN.  If  the  last  estimate  of  £  is  within  0.001  of  X^j 
(implying  divergence) ,  a  new  initial  estimate  of  £  is  provided. 
If  the  estimate  of  £  is  again  within  0.001  of  X^,  then 

is  censored  along  with  any  other  sample  points  equal  to 

A 

it  and  £  is  set  to  the  last  censored  observation.  The  CVM 
MD-estimators  for  y  and  a  are  then  calculated  by  using  £  in 
the  uncensored/censored  ML-equations  of  y  and  a  thereby 

a  ^ 

yielding  y  and  c. 

Anderson-Darling  Estimators .  The  final  distance 
measure  investigated  in  this  research  is  the  Anderson-Darling 
Statistic.  This  distance  measure  is  similar  to  the  CVM 
measure  in  that  it  finds  the  discrepancy  between  two  distribu¬ 
tion  functions  but  with  more  emphasis  being  placed  on  the 
tails  of  the  distribution.  Stephens  gives  the  computational 
formula  as  (26:731): 


A2  =  - 


i=l 


(2i-l)ln(zi+ln(l-zn+1_i)/n 


-n 


(6.12) 


Again,  z^  is  the  estimated  standard  normal  cdf  evaluated  at 
the  ith  sample  point  and  n  is  the  sample  size.  The  minimiza¬ 
tion  process  is  identical  to  that  of  the  CVM  distance  calcula- 

£  £  £■ 

tion  just  described.  Again,  once  £  is  obtained,  y  and  a  are 
calculated  via  their  uncensored/censored  ML-equations. 

Appendix  C  contains  a  listing  of  the  computer  program 
used  to  calculate  each  MD-estimator .  The  approximate  CPU 
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time  required  for  one  run  of  1,000  replications  varied  from 
250  to  300  seconds  depending  on  the  sample  size. 


Evaluation  Criteria 

The  final  step  in  this  Monte  Carlo  analysis  is  to 
evaluate  and  compare  all  of  the  estimates  obtained  from  each 
estimation  technique.  This  thesis  uses  two  modified  maximum 
likelihood  techniques  along  with  three  minimum  distance  tech¬ 
niques  for  each  yielding  a  total  of  eight  sets  of  parameter 
estimates  (y,  o,  t)  or  twenty-four  different  parameter  estimates 
for  each  sample.  To  make  any  relevant  conclusions  out  of  all 
the  different  parameters  and  parameter  sets,  certain  criteria 
for  evaluation  are  needed.  Three  approaches  are  used  for 
this  evaluation:  mean  square  error  (MSE) ,  relative  efficiency 
(REFF) ,  and  the  Cramer-von  Mises  Statistic. 

The  MSE  is  a  measure  of  how  close  each  estimated  para¬ 
meter  is  to  its  true  value  and  is  useful  for  investigating 
the  strength  of  each  estimator.  MSEs  are  calculated  by 

1000  „ 

MSE  =  Z  (0.-0) 2/1000 
i=l 

where  9  is  the  true  parameter  value  and  9i  is  the  parameter 
estimate  from  the  ith  sample  and  particular  estimation  tech¬ 
nique.  The  smaller  the  MSE,  the  better  the  estimator.  The 
REFF  is  another  tool,  similar  to  MSE,  which  aids  in  the  compari¬ 
sons  of  individual  estimators.  Calculation  of  REFFs  are 
accomplished  by: 


5 


REFF 


MSE 

m 


/MSEi 


where  MSE  is  the  mean  square  error  of  a  reference  estimator 
m 

and  MSE^  is  the  mean  square  error  of  the  estimator  being 

tested.  For  this  analysis,  MSE  is  taken  as  the  estimators 

m 

from  the  interpolative  modified  ML-method.  Values  of  REFF 
greater  than  unity  imply  that  the  estimator  is  performing 
better  than  the  interpolative  modified  maximum  likelihood 
estimator.  The  third  evaluator  is  a  distance  measure  which 
provides  an  overall  measure  of  how  well  the  estimated 
distribution  fits  the  true  distribution.  An  appropriate 
measure  of  this  type  is  the  Cramer-von  Mises  statistic, 
defined  by  Eq  (5.2).  In  this  application,  however,  Gn  is 
replaced  by  #-  the  estimated  cdf  (see  Figure  4) .  The  integral 
must  then  be  multiplied  by  the  sample  size  to  form  the  actual 
distance  measure  (3:32).  Since  3F  (x)  =  -j^-dx  =  f(x)dx  , 
the  formula  used  to  compute  the  distance  between  the 

/v 

estimated  cdf,  F,  and  true  cdf,  F,  is: 


W2  (F,F) 


n  /  F (x: 0) -F (x: 0) 

CL 


2 

f  (x:0)dx 


(6.13) 


where  0  represents  the  parameter  estimates  and  0  represents 
the  true  parameter  values.  The  limits  of  the  integral  are 
from  C  to  »  as  the  3-LN  distribution  is  not  defined  outside 
this  range.  This  integral  is  evaluated  using  15-point 
Laguerre  integration.  A  function  g(x)  with  limits  ranging 
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from  0  to  00  may  be  numerically  integrated  by  Laguerre  integra¬ 
tion  using  the  relationship  (1:923): 

00  n 

f  g(x)dx  I  W.e  g(x.)  (6.14) 

0  i=l  1  1 

where  and  x^  are  the  ifc^  weights  and  abscissa  of  the 
Laguerre  polynomials  and  n  is  the  number  of  quadrature  points — 
in  this  case  n  =  15  .  Before  applying  Laguerre  integration, 

however,  the  lower  limit  is  increased  to  £  by  adding  the 
true  value  of  Z,  to  each  abcissa.  The  weights  and  abcissa 
are  taken  from  the  Handbook  of  Mathematical  Functions  with 
Formulas,  Graphs  and  Mathematical  Tables  (1) . 

The  CVM  statistic  just  described  is  calculated  for 
each  of  the  1,000  replications  of  a  given  sample  size  and 
estimation  technique.  The  sample  mean  (MCVM)  and  sample 
standard  deviation  (SDCVM)  are  calculated  by: 

1000 

MCVM  =  Z  W.2/1000 
i-1  1 

/Tooo  I 

SDCVM  =  /  Z  (VI.2-  MCVM)2/1000 
i=l  1 

where  W^2  is  the  CVM  distance  of  the  ifc^  replication.  MCVM 
and  SDCVM  are  the  overall  measures  of  how  well  the  estimated 
distribution  fits  the  true  distribution.  The  smaller  the 
MCVM  and  SDCVM  the  better  the  fit. 
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Much  can  be  learned  from  investigating  the  three 


evaluation  criteria  just  described.  The  MSEs  are  extremely 
useful  in  determining  which  estimation  method  worked  best  on 
a  particular  parameter.  Unfortunately,  however,  the  MSEs  are 
not  scale  invariant;  the  same  size  MSE  may  be  highly  signifi¬ 
cant  for  a  small  valued  parameter  but  insignificant  for  a 
larger  one  (2:32).  The  relative  efficiencies  yield  approximately 
the  same  type  of  information  about  a  particular  parameter 
except  that  a  base  estimator  is  used  which  facilitates 
comparisons  across  all  estimators.  The  CVM  distance  has  the 
advantages  over  the  MSE  and  REFF  in  that  it  is  a  single  measure 
of  fit  for  all  three  parameters  and  is  also  scale  invariant 
with  respect  to  the  magnitude  of  the  parameter  estimates. 
Unfortunately,  information  concerning  the  individual  parameters 
is  not  provided.  A  closing  comment  on  the  CVM  distance 
measure  is  that  no  bias  is  introduced  by  using  the  CVM 
statistic,  both  for  the  MD-method  and  as  an  evaluation 
criteria.  This  should  be  apparent  since  during  minimization 
the  discrepancy  between  the  estimated  distribution  function 
and  1/n  step  function  is  calculated,  whereas  during  evaluation 
the  discrepancy  between  the  true  and  estimated  cdfs  is  found. 
Investigation  of  these  three  criteria  provides  more  than 
enough  information  to  draw  valid  inferences  about  each  modi¬ 
fied  maximum  likelihood  and  minimum  distance  estimator.  A 
computer  listing  of  the  program  used  to  compute  these  three 
evaluation  criteria  is  given  in  Appendix  D. 
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VII.  Results  and  Conclusions 


Results  and  Comparisons 

Numerical  results  pertaining  to  each  of  the  estima¬ 
tion  methods  are  listed  in  Appendix  A.  For  each  of  the 
fifteen  different  cases  (i.e.,  combination  of  parameter 
set  (y,cr,£;)  and  sample  size),  there  corresponds  a  page  in 
Appendix  A  containing  a  table  of  the  mean  square  errors 
(MSEs)  relative  efficiencies  (REFFs)  and  the  means  (MCVMs) 
and  standard  deviations  (SDCVMs)  of  the  Cramer-von  Mises 
statistics.  The  eight  different  estimation  methods: 
modified  interpolative  maximum-likelihood  (INT)  estimation, 
modified  iterative  maximum-likelihood  (ITR)  estimation,  and 
combinations  of  the  INT  and  ITR  estimates  with  Kolmogrov 
minimum  distance  (KOL)  estimation,  Cramer-von  Mises  minimum 
distance  (CVM)  estimation,  and  Anderson-Darling  minimum 
distance  (A-D)  estimation  are  listed  down  the  side  of  each 
table.  The  minimum  distance  estimators  obtained  by  the  INT 
estimates  are  listed  directly  under  INT,  and  the  minimum 
distance  estimators  obtained  by  the  ITR  estimates  are  listed 
directly  under  ITR.  At  the  top  of  each  table  the  sample 
size  and  true  parameter  values  are  given.  In  the  following 
row,  "MU,"  "SIG , "  and  "XI"  denote  the  parameters  y,  a,  and 
£  respectively.  The  entries  in  the  "DIV"  column  are  the 
number  of  times  the  location  parameter  estimate  is  equated  to 
the  last  censored  observation,  usually  the  first  order 
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statistic.  In  the  "CVM  statistics"  table,  "MCVM"  is  the 
mean  of  the  Cramer-von  Mises  statistic  and  "SDCVM"  is  the 
estimate  of  the  standard  deviation.  When  reference  is  made 
about  a  specific  combination  of  a  modified  maximum  likelihood 
estimates  with  a  minimum  distance  estimator,  the  notation 
ML/MD  is  used  where  ML  is  either  INT  or  ITR,  and  MD  is 
either  KOL,  CVM,  or  A-D  depending  on  the  estimation  technique 
applied.  It  should  be  noted  that  in  some  cases,  the  MSEs 
and  the  MCVM  appear  to  contradict  each  other.  In  other  words, 
one  method  may  have  larger  MSEs  for  each  parameter  but  a 
smaller  overall  MCVM  when  compared  to  another  method.  This 
situation  arises  in  the  case  of  N=10  ,  y  =  0.0  ,  a  =  1.0  , 
£  =  10.0  ;  ITR/KOL  has  larger  MSEs  for  each  parameter  but 
a  smaller  overall  MCVM  than  the  INT/CVM  estimates.  This 
apparent  dichotomy  occurs  because  a  set  of  values  further 
away  from  the  true  values  in  a  MSE  sense  may  be  closer  in  a 
distance  sense.  Results  obtained  from  the  MSE,  REFF,  and 
CVM  Statistics  tables  are  summa.  ized  in  the  following  para¬ 
graphs. 

As  previously  stated,  the  MSE  provides  a  measure  of 
how  close  each  estimated  parameter  is  to  its  true  value,  while 
the  REFF  provides  for  comparisons  across  all  estimators  by 
the  use  of  a  base  estimator.  The  MSEs  from  all  estimation 
methods  considered  in  this  thesis  are  extremely  good  in  that, 
except  for  cases  where  N  =  6  ,  no  MSE  of  any  parameter  is 

ever  greater  than  unity  1  In  support  of  the  asymptotic  ML 
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properties,  the  MSEs  of  each,  parameter  decrease  as  the  sample 
size  increases  for  the  INT  and  ITR  estimators  with  the 
exception  of  the  location  estimator  via  ITR  estimation. 

This  exception  seems  reasonable  since  the  location  estimate 
is  equated  to  the  first  order  statistic  in  this  method.  Also, 
the  decreasing  MSEs  of  the  interpolation  estimate  for  the 
location  parameter  as  the  sample  size  increases  lends  strong 
support  for  the  consistency  assumption  of  the  interpolative 
location  estimator.  When  the  term  "best"  or  "better"  is 
mentioned  in  the  following  text,  it  refers  to  best  or  better 
in  the  MSE  sense.  It  should  also  be  remembered  that  in  every 
case  C  =  10.0  .  When  looking  at  a  particular  sample  size, 

the  MSEs  for  all  methods  have  a  tendency  to  increase  as  the 
parameter  values  increase;  however,  this  is  not  unexpected 
since  the  MSE  is  not  scale  invariant.  In  comparing  the  INT 
and  ITR  estimators,  the  REFFs  reveal  that,  in  each  case,  the 
INT  estimator  yields  better  estimates  than  the  ITR  estimators 
except  for  the  location  parameter  when  y  =  1  ,  a  =  1 

The  REFFs  also  show  that  the  INT/CVM  and  INT/A-D  estimators  are 
better  than  their  respective  ITR/CVM  and  ITR/A-D  estimators 
for  cases  of  y  »  0  ,  a  =  1  and  y  =  1  ,  a  =  1  ;  in 

cases  of  y=l  ,  a  =  2  ,  the  reverse  is  true.  It  is  also 

interesting  to  note  which  estimators  perform  the  best  for 
each  parameter  across  all  possible  cases. 

For  cases  of  y  =  0.0  ,  a=1.0  ,  the  best 

estimator  for  y  is  the  ITR/KOL  estimator  when  N  =  6,8,10  ; 
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for  N  =  12,16  the  best  estimators  for  y  are  the  CVM  and 
A-D  estimators  using  the  INT  or  ITR  estimates.  The  best 
estimator  for  a  is  the  INT/A-D  estimator  except  when  N  =  16 
where  the  INT/CVM  estimator  is  best.  The  best  estimator 
for  £;  is  INT/A-D  estimator  except  for  N  =  6  in  which  case 
the  INT/CVM  estimator  is  best. 

For  cases  of  y  =  1.0  ,  a  =  1.0  ,  the  best 

estimator  of  each  parameter  across  all  sample  sizes  is  the 
ITR/KOL  estimator. 

For  cases  of  y  =  1.0  ,  a  =  2.0  ,  the  best 
estimator  for  y  when  N  =  6,8,10  is  the  ITR/KOL  estimator; 
however,  when  N  =  12,16  the  best  estimator  varies  between 
the  combinations  of  INT  and  ITR  estimates  with  the  CVM  and 
A-D  methods.  The  best  estimator  of  a  is  the  ITR/CVM  estimator 
followed  closely  by  the  ITR/A-D,  INT/CVM,  and  INT/A-D 
estimators.  The  best  estimator  of  £  varies  between  the  ITR/CVM 
and  ITR/A-D  estimators;  the  INT/CVM  and  INT/A-D  estimators 
are  much  worse. 

The  MCVM  measure  also  reveals  some  important  informa¬ 
tion  regarding  the  estimators.  Results  from  this  measure  are 
usually  interesting  since  it  provides  an  overall  measure  of 
how  well  the  estimated  distribution  function  fits  the  true 
distribution  function.  The  term  "best"  or  "better”  is  now 
changed  to  mean  best  or  better  in  the  minimum  distance  sense. 
When  the  sample  sizes  are  fixed  and  the  parameter  sets  are 
investigated  in  the  order  (y  =  0.0  ,  a  =  1.0), 

(y  =  1.0  ,  a  =  1.0),  and  (y  =  1.0  ,  c  =  2.0),  the  results 


62 


show  that  (1)  the  MCVM  for  the  INT  estimator  gets  larger, 

(2)  the  MCVM  for  the  ITR  estimator  gets  smaller,  and  (3) 
the  MCVM  for  the  MD-estimators  are  always  smaller  than 
either  the  INT  or  ITR  estimators,  except  for  one  case  where 
N  =  16  ,  and  remain  fairly  constant.  This  indicates  that 

in  choosing  between  the  INT  and  ITR  estimators,  the  INT 
estimator  should  be  used  for  values  of  y  and  a  less  than 
unity,  whereas  for  values  of  p  and  a  greater  than  unity,  the 
ITR  estimator  is  the  correct  choice. 

For  cases  of  y  =  0.0  ,  a  =  1.0  ,  the  best  over¬ 

all  estimator  is  the  ITR/KOL  estimator  except  for  N  =  16 
where  the  INT/CVM  estimator  is  best. 

For  cases  of  y  =  1.0  ,  a  =  1.0  ,  the  best  estimator 

is  the  ITR/KOL  estimator  except  for  N  =  16  in  which  case 
the  INT/A-D  estimator  is  best. 

For  cases  of  y  =  1.0  ,  a  =  2.0  ,  the  best  estimator 

for  N  =  6,8,10  is  the  ITR/CVM  estimator;  for  N  =  12  is 
the  ITR/KOL  estimator;  and  for  N  =  16  is  the  INT/CVM 
estimator.  The  estimators  are  closely  followed  by  other 
combinations  of  INT  and  ITR  estimates  with  the  CVM  and  A-D 
methods . 

The  MCVM  results  clearly  show  that  the  minimum  dis¬ 
tance  estimators  using  the  modified  maximum  likelihood 
estimates  are  far  more  superior  than  the  modified  maximum 
likelihood  estimators.  The  dominant  estimator  appears  to  be 
the  ITR/KOL  estimator  followed  closely  by  the  ITR/CVM, 
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ITR/A-D,  INT/CVM,  and  INT/A-D  estimators.  As  a  final  note, 
the  SDCVM  value  for  each  estimator  is  extremely  small 
indicating  the  stability  of  each  estimator. 

Conclusions 

The  objective  of  this  thesis  is  the  comparison  of 
ML-estimators  against  MD-estimators  given  small  samples  of 
3-LN  deviates.  It  was  anticipated  that  the  MD-estimators 
would  yield  much  more  accurate  estimates  than  those  from 
the  MD-estimators.  Due  to  the  small  sample  sizes,  "strict" 
ML-estimators  could  not  be  obtained  so  two  modified  ML- 
estimation  techniques  are  developed  and  compared  against 
three  MD-estimation  methods.  The  results  of  the  Monte  Carlo 
analysis  clearly  demonstrates  the  superiority  of  the  MD- 
estimators  over  both  modified  ML-estimators. 

Considering  the  eight  estimators  tested  (given  the 
ranges  of  parameter  values  and  sample  sizes)  in  the  Monte 
Carlo  analysis,  several  conclusions  are  made.  The  MD- 
estimators  are  always  superior  to  both  modified  ML-estimators. 
Of  the  modified  ML-estimators,  the  interpolative  method  gives 
the  best  estimates.  Of  the  MD-estimators,  it  appears  that 
the  ITR/KOL  estimator  gives  the  best  estimates,  but  the 
INT/KOL  estimator  gives  the  worst  estimates.  The  result  may 
be  partly  due  to  the  location  parameter  never  being  set  to 
the  last  censored  order  statistic  by  the  ITR/KOL  estimator. 

The  CVM  and  A-D  estimators  yield  approximately  equivalent 
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estimates  regardless  of  whether  the  INT  or  ITR  estimates  are 
used.  This  fact  lends  further  support  to  the  robustness 
property  of  the  MD-estimators . 

Although  the  estimators  are  rated  as  the  "best"  or 
"worst”  estimators  in  the  above  paragraphs,  it  should  be 
noted  that  no  estimator  gives  extremely  poor  estimates.  For 
the  MD-estimators,  the  MSEs  are  rarely  greater  than  0.5  and 
the  MCVM  are  usually  less  than  0.2.  For  the  modified  ML- 
estimators,  the  greatest  MSE  is  1.360707  while  the  MCVM  is 
usually  much  less  than  0.5.  Furthermore,  if  great  accuracy 
of  the  estimates  is  not  required,  or  if  a  computer  is  not 
available,  then  the  interpolative  ML-method  becomes  a  very 
viable  estimation  technique  since  the  estimates  can  be 
easily  computed  using  a  desk  or  hand-held  calculator;  all 
other  estimation  methods  require  computerization  due  to  more 
involved  computations. 

Recommendations  for  Further  Study 

Several  possibilities  exist  for  further  research  in 
parameter  estimation  techniques  for  the  3-LN  distribution. 
First  and  foremost  would  be  the  development  of  an  iterative 
estimation  methodology  allowing  for  the  solution  of  the  ML- 
equations  for  each  parameter  when  working  with  small  sample 
sizes.  As  it  stands  now,  the  MLE  of  the  location  parameter 
is  equated  to  the  first  order  statistic,  while  y  and  a  are 
iteratively  estimated.  Perhaps  using  the  interpolation 
estimate  of  £  as  an  initial  guess,  an  iterative  procedure 


could  be  developed.  In  this  procedure,  £  could  be  shifted 

A 

around  itself  in  small  step  lengths  (i.e.,  0.001)  with  y  and 
a  being  calculated  at  each  step.  At  each  step  the  likelihood 
function  would  also  be  calculated  to  determine  the  direction 
of  the  next  step  for  the  location  estimate.  In  this  way, 
the  location  estimates  are  independent  of  the  steps  and  may 
yield  strict  ML  estimators. 

In  this  thesis,  only  the  location  parameter  is 
refined  by  the  MD-methods.  The  remaining  two  parameters 
could  be  similarly  estimated.  An  iterative  procedure  could 
be  developed  where  all  three  parameters  are  found  via  MD- 
methods.  For  example,  a  could  be  estimated  using  Kupier's 
Maximal  Interval  Probability  Statistic  and/or  Watson's 
Statistic,  while  y  and  £  are  estimated  by  the  Kolmogrov 
Distance,  Cramer-von  Mises  Statistic  and/or  the  Anderson- 
Darling  Statistic.  It  is  anticipated  that  this  procedure 
would  provide  much  more  accurate  estimators  than  those 
obtained  in  this  thesis.  As  such,  this  topic  would  be  an 
excellent  research  area  for  an  adventurous  statistician. 
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Appendix  A 


Tables  of  Mean  Square  Errors, 
Relative  Efficiencies  and 
CVM  Statistics 


Notation 

MU  . Estimate  of  y 

SIG  . Estimate  of  o 

XI  . Estimate  of  £ 

DIV  . Number  of  Times  £  = 

INT  .  Interpolative  ML-Method 

KOL  .  Kolmogrov  MD-Method  Using  INT  Estimates 

CVM  .  CVM  MD-Method  Using  INT  Estimates 

A-D  .  A-D  MD-Method  Using  INT  Estimates 

ITR  .  Iterative  ML-Method 

KOL  .  Kolmogrov  MD-Method  Using  ITR  Estimates 

CVM  .  CVM  MD-Method  Using  ITR  Estimates 

A-D  .  A-D  MD-Method  Using  ITR  Estimates 

MCVM  . Mean  of  the  CVM  Distance 

SDCVM  .  Standard  Deviation  of  CVM  Distance 
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TABLE  A. 1 


SAMPLE  SIZE  6 
TRUE  MU  0 . 00 
TRUE  SIGMA  1.00 
TRUE  XI  10.00 


MU 

SIG 

MEAN  SQUARE  ERRORS 

XI 

DIV 

INT 

.655310 

.286817 

. 105316 

KOL 

1 .360707 

1 .103962 

.143489 

3 

COM 

.547679 

.178744 

.098908 

14 

A-D 

.546901 

.195547 

.  100976 

5 

ITR 

.844672 

.477264 

.493105 

KOL 

.327784 

.244997 

.346987 

0 

COM 

.684417 

.292270 

.127459 

181 

A-D 

.669653 

.279103 

RELATIVE  EFFICIENCIES 

. 117645 

280 

INT 

1.000000 

1.000000 

1.000000 

KOL 

.481575 

.259807 

.733966 

COM 

1.192124 

1.604626 

1.064771 

A-D 

1.198223 

1.466725 

1.042988 

ITR 

.775797 

.576791 

.213578 

KOL 

1.907085 

1.170695 

.303516 

CVM 

.957472 

.981278 

■>  826280 

A-D 

.978581 

1.027640 

.880239 

CVM  STATISTICS 


MCVM 

SDCVM 

int  : 

.366810 

.000491 

KOL  : 

.188285 

.000189 

cvm  : 

.055743 

.000056 

a-d  : 

.058678 

.000059 

itr  : 

.707892 

.000710 

KOL  : 

.030761 

.000031 

cvm  : 

.062547 

.000063 

a-d  : 

.062547 

.000063 

) 
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TABLE  A. 2 


SAMPLE  SIZE  6 
TRUE  MU  1.00 
TRUE  SIGMA  1.00 
TRUE  XI  10.00 


Ml  J 

sic; 

XI 

DIO 

MEAN  SOU ARE  ERRORS 

I  NT 

. 65576? 

.207204 

. 779412 

KOL 

1 . 332993 

1 . 030375 

. 9 4 33 05 

0 

COM 

.355340 

.18024? 

.727961 

0 

a-h 

. 555o60 

.190898 

.745718 

0 

ITR 

.864145 

. 52561 5 

. 359428 

KOL 

.084419 

.071938 

.  564376 

0 

COM 

.753766 

.385351 

.926223 

67 

A-Li 

.686059 

.286989 

.834891 

184 

RELATIOE  EFFICIENCIES 

I  NT 

1.000000 

1.000000 

1.000000 

KOL 

.491952 

.276590 

.026256 

COM 

1.180826 

1.593375 

1.070677 

A-P 

1.180162 

1.443979 

1.045183 

ITR 

.758865 

.546415 

2.168476 

KOL  : 

7.768037 

3.992364 

2.139032 

COM 

.869990 

.745306 

.841495 

A-D  J 

.955849 

1.000751 

.933549 

COM  STATISTICS 


MCVM 

SPCVM 

int  : 

.482803 

.003600 

KOL  : 

.146933 

.000032 

cvm  : 

.056385 

.000021 

a-d  : 

.059234 

.000025 

itr  : 

.342977 

.000025 

KOL  : 

.037457 

.000041 

cvm  : 

.062233 

.000009 

A-P  ♦. 

.062650 

.000022 
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TABLE  A. 3 


SAMPLE  SIZE  6 

TRUE  MU  1.00 

TRUE  SIGMA  2.00 

TRUE  XI  10.00 

MU 

SIG 

XI 

MEAN  SOU ARE  ERRORS 

.943638 

.521600 

1.035676 

1.357249 

.742137 

.982729 

.088670 

.503325 

1.200742 

.893750 

.526195 

1 . 259358 

.964483 

.628586 

♦829922 

.442460 

.670193 

.689556 

.861390 

.379846 

.545708 

.859002 

.470960 

.596845 

RELATIVE  EFFICIENCIES 

1.000000 

1.000000 

1.000000 

.695258 

.702834 

1.053878 

1.061844 

1.036308 

.862530 

1.055810 

.991267 

.822384 

.978387 

.829798 

1.247920 

2.132709 

.778283 

1 .501946 

1.095472 

1.373186 

1.897579 

1.098528 

1.107523 

1.735251 

CVM  STATISTICS 


MCVM 

.617581 

.118615 

.059271 

.063324 

.272654 

.060480 

.057036 

.058148 


SDCVM 

.021080 

.000612 

.000850 

.000845 

.000792 

.000959 

.000737 

.000770 


TABLE  A. 4 


SAMPLE  SIZE  0 
TRUE  M(J  0.00 
TRUE  SIGMA  1.00 
TRUE  XI  10.00 


MU 

BIG 

XI 

MEAN  SQUARE  ERRORS 

int  : 
KOI. 
COM 
A-n 

,476877 
.929417 
. 40 6065 
. 3934 64 

.272087 
. 939559 
.176720 
.  166579 

.071213 
.  1 006  J.  4 
.070241 
.067290 

ITR 

KOL 

COM 

A--n 

.618789 
.373430 
.  501 1 02 
.492213 

. 4 1 6204 
.234557 
.253267 
.249537 

.549261 

.405017 

,084231 

.084408 

RELATIVE  EFFICIENCIES 

INT 

KOL 

COM 

A-D 

1 .000000 
.513093 
1.174307 
1.211999 

1,000000 

.290442 

1 .544111 

1.658130 

1.000000 
,707780 
1.013837 
1 . 058307 

ITR 

KOL 

COM 

A-D 

.770662 

1.277021 

.951505 

.968843 

.655657 

1,163414 

1.077469 

1.093573 

.129653 

.175828 

,845454 

.843676 

COM  STATISTICS 

DIO 


7 

12 

rr 


o 

2  AS 
290 


int  : 

KOL  : 
com  : 

A-n  : 


itr  : 

KOI.  : 

com  : 
a-d  : 


MCOM 

.488003 

,171784 

.040006 

.038105 

.952907 

,020390 

.041624 

.041624 


SDCOM 

.000641 

,000172 

.000040 

.000038 

.000953 

,000020 

.000042 

,000042 
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TABLE  A. 5 


SAMPLE  SIZE  8 
TRUE  MU  1.00 
TRUE  SIGMA  1.00 
TRUE  XI  10.00 


MU 

SIG 

XI 

DIU 

MEAN  SQUARE  ERRORS 

I  NT 

.476934 

.273282 

.526903 

KOL 

.953368 

.935649 

.668776 

1 

CVM 

♦426717 

.191437 

.523154 

3 

A -Li 

.393836 

.172325 

.497591 

2 

ITR 

.626356 

.431749 

.282776 

KOL 

.071923 

.063611 

.223272 

0 

CUM 

.539125 

.352430 

.635454 

92 

A-D 

.499790 

.255373 

.593109 

226 

RELAT1UE  EFFICIENCIES 

INT 

1.000000 

1.000000 

1.000000 

KOL 

.500263 

.292077 

.787861 

CUM 

1 .117634 

1.427528 

1.007167 

A-D 

1.195815 

1 .581250 

1.050903 

ITR 

.761443 

.632964 

1.863320 

KOL 

6.631136 

4.296162 

2.308229 

CUM 

.884645 

.775310 

.329176 

A-D 

.954254 

1.060013 

.830374 

CUM  STATISTICS 


MCUM 

SDCUM 

INT 

.639525 

.004718 

KOL 

.121675 

.000079 

CUM 

.039762 

.000046 

A-D 

.037358 

.000003 

ITR 

,386990 

.000008 

KOL 

.020569 

.000016 

CUM 

.169935 

.000030 

A-D 

.040439 

.000004 
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TABLE  A. 6 


SAMPLE  SIZE  8 
TRUE.  MU  1.00 
TRUE  SIGMA  2.00 
TRUE  XI  10.00 


MU 

SIG 

XI 

MEAN  SQUARE  ERRORS 

TNT  : 
KOL.  : 

com  : 
a-d  : 

.6/6206 
.92 4357 
.AC7344 
,639233 

.413293 
>  5879  J.  3 
.365302 
.396803 

.283798 
.269736 
. 365726 
.333915 

itr  : 

KOL  : 

com  : 
a-d  : 

.692689 

.440742 

.621100 

,623585 

.480513 

,614440 

.305793 

.366783 

.660948 

.407359 

.185932 

.176682 

RELATION  EFFICIENCIES 

int  : 

KOL  : 

com  : 
a-d  : 

1.000000 

,73.1628 

1.061100 

1 . 057955 

1.000000 
.7029 77 
1.131126 
1.041556 

1.000000 

1.052133 

.775985 

.849910 

itr  : 

KOL  : 

com  : 
a-d  : 

.976319 

1.534424 

1.083851 

1.084512 

,860108 

.672634 

1,351547 

1 .126805 

.429380 

.582318 

1.526353 

1.606268 

COM  STATISTICS 

MCOM 

SDCOM 

INT  : 

.816757 

.028034 

KOL  t 

,096663 

,000887 

COM  : 

.035378 

.000723 

A-D  : 

.036698 

.000713 

itr  : 

.291.249 

.000808 

KOL  : 

.041684 

.000747 

com  : 

.035354 

.000707 

a-d  : 

.035622 

.000718 

♦  ♦ 


DIO 


0 
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TABLE  A. 7 


SAMPLE  SIZE  10 
TRUE  MU  0.00 
TRUE  STOMA  1.00 
TRUE  XI  10.00 


MU 

SIR 

XI 

MEAN  SQUARE  ERRORS 

INI 

.303317 

.250400 

.031575 

KUL 

.601101 

.772202 

.075254 

COM 

. 335937 

.105933 

.050492 

A-D 

.313620 

.157262 

.047732 

ITR 

.496913 

.379719 

.590799 

KOI... 

. 402003 

.227906 

.461629 

COM 

.403179 

.244989 

.064706 

A-D 

.394051 

.241653 

.062153 

DIO 


3 

10 

9 


0 

29  3 
277 


RELATIVE  EFFICIENCIES 


I  NT 

1.000000 

1.000000 

1 

.000000 

KOL 

.562790 

.324371 

.685340 

COM 

1 .140972 

1.347152 

1 

.021448 

A-D 

1  .214480 

1 .592757 

1 

.079382 

ITR 

.771396 

.659646 

.087298 

KOL 

.953513 

1 .098664 

.111725 

COM 

.950736 

1.0224J4 

.797073 

A-D 

.970790 

1.036528 

.829809 

COM  STATISTICS 


MCVM 

SDCVM 

I  NT  : 

.612318 

.000539 

KOL  : 

.276212 

.000003 

cum  : 

.126936 

.000000 

a-d  : 

.130615 

,000000 

itr  : 

1.565242 

.000001 

KOL  : 

.051737 

.000000 

cvm  : 

.168352 

,000001 

a-d  : 

.168352 

.000001 
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TABLE  A. 8 


SAMPLE  SIZE  10 
TRUE  MU  1.00 
TRUE  SIGMA  1.00 
TRUE  XI  10.00 


MLJ 

SIS 

MEAN  SQUARE  ERRORS 

XI 

DIO 

I  NT 

.383927 

.250430 

.381625 

KOL 

. 750273 

.891645 

.503733 

0 

CUM 

.  363375 

.220912 

.381512 

0 

A-D 

.31796 5 

. 157337 

.352664 

0 

ITR 

.'5051b! 

.391544 

.261686 

KOL 

.061069 

.057130 

.178601 

0 

CVM 

.417473 

.300833 

.474617 

125 

A-D 

.402501 

.246345 

EEI  ATI VE  EFFICIENCIES 

.452670 

245 

INT 

1.000000 

1.000000 

1.000000 

KOL 

.511716 

.230363 

.757586 

COM 

1.055108 

1.133620 

1.000296 

A-D 

1.207451 

1.591178 

1.032120 

ITR 

.760024 

.639597 

1.45832? 

MIL 

6.286779 

4.333520 

2.136742 

COM 

.919647 

.832456 

.804063 

A-D 

.953354 

1.016536 

.843054 

COM  STATISTICS 


MCOM 

SDCOM 

INT 

.806153 

.006031 

KOL 

.226466 

,0002.32 

COM 

.124742 

.000030 

A-D 

.127688 

.000021 

ITR 

.749610 

.000051 

KOL 

.086908 

.000019 

COM 

.159530 

. 000035 

A-D 

.159530 

.000035 
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TABLE  A. 9 


SAMPLE  SIZE  10 
TRUE  MU  1.00 
TRUE  SIGMA  2.00 
TRUE  XI  10.00 

MU  SIG  XI  LUU 
MEAN  SQUARE  ERRORS 


INI 

.532332 

.310910 

,117524 

KOL 

.683067 

. 472942 

.  1 12787 

3 

COM 

. 502089 

. 272678 

. 152260 

6 

A~D 

.497984 

.292731 

.  1.33133 

29 

ITE 

.547307 

,374777 

.684603 

kol 

.424192 

.574206 

,517111 

0 

CUM 

.407294 

.235218 

.085660 

71 

A-0 

.505876 

.277305 

,082215 

59 

RELATIUE  EFFICIENCIES 

I  NT 

1.000000 

1.000000 

1 . 000000 

KOL 

.770415 

,657395 

1.041999 

CUM 

1.060234 

1 .140207 

.771363 

A-D 

1.068974 

1 .062102 

.882425 

ITR 

.972640 

.829585 

.171666 

KOL 

1.254932 

.541461 

.227270 

CUM 

1.092424 

1.321796 

1.371976 

A~D 

1.052298 

1.121184 

1.429475 

CUM 

STATISTICS 

MCVM 

SDCVM 

INT 

1.035208 

.034789 

KOL 

.194634 

.000954 

CUM 

.112736 

,000801 

A-D 

.123434 

,000811 

ITR 

.581777 

.000956 

KOL 

.079166 

.000870 

CUM 

.112724 

,000805 

A-D  t  .112901 

.000813 
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TABLE  A. 10 


SAMPLE  SIZE  12 

TRUE  MU  0.00 

TRUE  SI  DMA  1.00 

TRUE  XI  10.00 

SI  (3 

MEAN  SQUARE  ERRORS 


I  NT 

.326100 

. 239326 

.043434 

KOI. 

.  5 *166^ *1 

.655909 

.060628 

COM 

.270527 

. 184550 

.043426 

A-D 

. 265766 

, 140 VVo 

.040502 

ITR 

.417743 

.334823 

.622391 

KOL 

.42372? 

. 226  359 

.501794 

COM 

, 343548 

. 226557 

.052435 

A-D 

.337610 

.220576 

.051215 

RELAX I OE  EFF I C I ENC I E S 

I  NT 

1.000000 

1.000000  1 

.000000 

KOL 

.596560 

♦364877 

.7.16413 

COM 

1 .122444 

1.296811  1 

.000202 

A-D 

1.227020 

1,697397  1 

.072407 

ITR 

.780250 

.714785 

.069786 

KOL 

.769595 

1.057287 

.086558 

COM 

.949211 

1.056364 

.828350 

A-D 

.965908 

1.085007 

.848074 

COM  STATISTICS 


MCVM 

SDCVM 

I  NT  : 

.735408 

.000703 

KOL  : 

.210019 

.000004 

com  : 

,064878 

.000000 

a-d  : 

.065139 

.000000 

ITR  J 

1.624189 

.000000 

KOL  : 

.023369 

,000000 

com  : 

.091758 

,000000 

a-d  : 

.091958 

.000000 
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cn  c«i  o- 


TABLE  A. 11 


sample:  size  12 

TRUE  MU  1.00 
TRUE  SIGMA  1.00 
TRUE  XI  10.00 


MIJ  SIG  XI  DJV 

MEAN  SQUARE  ERRORS 


INI 

.326100 

.239326 

. 320933 

KOL 

•  620869 

.307474 

.421894 

0 

COM 

. 323856 

.246831 

.331687 

3 

A-D 

.269186 

•  147453 

.297023 

n 

ITR 

.427455 

. 358289 

.272780 

KOL 

.062410 

.056860 

.  169423 

0 

COM 

.355350 

.272643 

.384187 

169 

A-D 

.347895 

.236927 

.374251 

287 

RELATIOE  EFFICIENCIES 

INT 

1.000000 

1,000000 

1.000000 

KOL 

.525231 

.296389 

.760697 

COM 

1.006930 

.969595 

,967577 

A-D 

1.211430 

1.623070 

1.080500 

ITR 

.762887 

.667970 

1.176527 

KOL 

5.225145 

4.209080 

1 .094276 

COM 

.917687 

.877801 

.835358 

A-D 

.937351 

1.010128 

.857536 

CUM  STATISTICS 


MCUM 

SDCVM 

INT  ; 

.975852 

.007515 

KOL  : 

.150207 

.000098 

cvm  : 

.063786 

.000029 

a-d  : 

.063633 

.000022 

itr  : 

.657586 

. 000051 

KOL  : 

♦036542 

.000013 

cum  : 

.088608 

.000033 

a-d  : 

♦088597 

.000033 
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TABLE  A. 12 


SAMPLE  SIZE  12 
TRUE  MU  1.00 
TRUE  SIGMA  2.00 
TRUE  XI  10.00 


MU 

SI  8 

XI 

DIV 

MEAN  SQUARE  ERRORS 

I  NT 

• 

t 

.  44904-4 

. 270869 

.086651 

7 

KOL 

♦ 

• 

.53917? 

.379041 

.082294 

COM 

♦ 

♦ 

.421967 

.226249 

. 119563 

12 

A -D 

• 

• 

.416589 

.240181 

.099595 

30 

ITR 

• 

• 

.459219 

.309192 

.725196 

KOL 

• 

• 

.422344 

.556705 

.565643 

0 

COM 

* 

♦ 

.411279 

.207014 

.054377 

67 

A-D 

♦ 

♦ 

.426530 

.237099 

.057158 

67 

RELATIVE  EFFICIENCIES 

INT 

• 

♦ 

1.000000 

1.000000 

1.000000 

KOL 

♦ 

• 

.83282? 

.714616 

1.052942 

COM 

• 

• 

1.064167 

1.197219 

.724734 

A-D 

• 

♦ 

1.077905 

1.127772 

.870033 

ITR 

♦ 

♦ 

.977841 

.876056 

.119486 

KOL 

• 

♦ 

1.063217 

.*86558 

.153190 

COM 

♦ 

• 

1.091821 

1.308461 

1.593510 

A-D 

• 

♦ 

1.052784 

1.142431 

1.515996 

COM  STATISTICS 


MCVM 

SDCVM 

int  : 

1,233838 

.041781 

KOL  J 

.114867 

.000514 

cvm  : 

.049472 

,000486 

a-d  : 

,051923 

,  000508 

itr  : 

.486826 

.000562 

KOL  : 

.046033 

.000585 

cvm  : 

.051311 

,000472 

a-d  : 

.051267 

.000469 
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TABLE  A. 13 


SAMPLE  SIZE  16 
TRUE  MU  0.00 
TRUE  SIGMA  1.00 
TRUE  XI  10.00 

MU 

SIS 

MEAN  SQUARE  ERRORS 

XI 

INI 

.232383 

.203052 

.029973 

KOL 

♦370743 

.509347 

.043096 

CVM 

. 222203 

.177816 

. 030684 

A-D 

. 174003 

. 126010 

. 02  7965 

IT!'; 

.700175 

.281351 

.667245 

KOL 

♦456917 

.221362 

.569259 

COM 

,250650 

.193066 

.036964 

A~D 

♦249732 

.  194706 

RELATIVE  EFFICIENCIES 

.036456 

INI 

1.000000 

1 .000000 

1.000000 

KOL 

.613565 

.398651 

.695501 

COM 

1.045437 

1.129222 

,976829 

A-D 

1.197831 

1.611390 

1.071822 

ITR 

.754063 

,721702 

.044921 

KOL 

.508507 

.917282 

.052653 

CVM 

.927125 

1.051723 

.810873 

A-D 

INT 

KOL 

CVM 

A-D 

.930532 

1.042865 

CVM  STATISTICS 

MCVM  SDCVM 

,058370  .000001 

.202583  .000010 

.052597  ,000001 

.056141  ,000001 

.822172 

ITR 

•  2 

.048071  .000002 

KOL 

:  l 

.690798  ,000008 

CVM 

.083688  ,000001 

A-D 

•  ♦ 

,083688  ,000001 

D I V 


4 

51 

30 


0 

336 

30? 
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TABLE  A. 14 


:nt 

iO\. 

:vm 

V-D 

[TR 

COL 

:vm 

“i-D 


i  nt  : 

KOL  : 

cvm  : 
a-d  : 

itr  : 

KOL  : 

vm  : 
-d  : 


int  : 
nol  : 
cvm  : 
a-d  : 

itr  : 
kcjl  : 
cvm  : 
a-d  : 


SAMPLE  SIZE  16 
TRUE  MU  1 . 00 
TRUE  SIGMA  1*00 
TRUE.  XI  10.00 


MU 

SIG 

XI 

MEAN  SQUARE  ERRORS 

.232303 
.432193 
.242403 
.  190384 

. 203052 
.656473 
.228070 

.119146 

.221497 

.307171 

.237409 

.202699 

.308562 

,061823 

.256022 

.250442 

.282413 
.054438 
.220713 
.  1.94406 

.307384 
. 188856 
.270062 
.268905 

RELATIVE  EFFICIENCIES 

1 .000000 
.537685 
.958667 
1.220607 

1.000000 

.309307 

.890274 

1.704219 

1.000000 

.721088 

.932978 

1.092739 

.753118 

3.758850 

.907668 

.927893 

.718989 

3.729951 

.919982 

1.044473 

.720507 
1.172838 
.820171 
. 823699 

niv 


3 

3 


0 

204 

302 


CVM  STATISTICS 


MCVM 

.05971V 

.152448 

.076387 

.056961 

.772859 

.640381 

.080935 

.084114 


SDCVM 

,000106 

.000483 

.000794 

.000078 

.000093 

.000413 

.000253 

.000106 


8 


TABLE  A. 15 


SAMPLE!  SIZE  16 
TRUE  MU  1.00 
TRUE  SIGMA  2.00 
TRUE  XI  10.00 


MU 

SIG 

XI 

DIV 

MEAN  SQUARE  ERRORS 

I  NT 

.303700 

.200303 

.030553 

KOL 

.350235 

.259372 

.033354 

5 

CVM 

.291445 

. 165668 

.042253 

11 

A-D 

.287489 

.177657 

.032471 

38 

ITR 

.322491 

.235817 

.787416 

KOL 

.414454 

.51 9659 

.659067 

0 

CVM 

♦293632 

. 162336 

.024263 

77 

A-0 

.301267 

. 186689 

.024930 

61 

RELATIVE  EFFICIENCIES 

INT 

1.000000 

1.000000 

1.000000 

KOL 

.867134 

.772261 

.916034 

CVM 

1.042050 

1.209059 

.723105 

A-D 

1.056387 

1.127467 

.940942 

ITR 

.941733 

.849399 

.038802 

KOL 

.732771 

,385450 

.046359 

CVM 

1.034290 

1.233879 

1.259276 

A-D 

1.008075 

1.072922 

1 .225547 

CVM  STATISTICS 


MCVM 

SDCVM 

INT  : 

.059835 

.000964 

KOL  : 

.121674 

.001103 

cvm  : 

.057521 

.001007 

a-d  : 

.072231 

.001043 

itr  : 

.585003 

.000941 

KOL  : 

.653962 

.001038 

cvm  : 

.073151 

.000925 

a-d  : 

.059473 

.000954 
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Appendix  B 


Computer  Listing  of  Data  Generation  and 
Max imum  Likelihood  Techniques 


PROGRAM  BL'GLND 
C 

C  #*  >fc  **  ****** ***  ***  ¥  **  **  He******  *  4:  ******************************* 

C  * 

C  *  WRITTEN  BY  2LT  JIM  H  KEEFER  AFTT/G0R-83D  FOR  MS  THESIS 

C  *  DECEMBER  1783 


c 

c 

* 

* 

purpose: 

( 1  > 

GENERATE  SAMPLES  OF  3-L.ND  DEVIATES 

* 

* 

c 

* 

(2) 

CALCULATE  THE  MEAN  AND  STD.  DEV.  OF  SAMPLES 

* 

c 

* 

(3) 

CALCULATE  THE  MLE ' S  USING  THE  INTERPOLATION 

* 

c 

* 

ME 

TI-IOU 

* 

c 

* 

(4) 

CALCULATE  THE  MLE ' S  USING  THE  ITERATIVE 

* 

c 

* 

METHOD 

* 

c 

* 

* 

c 

* 

variables: 

DSEED 

- 

SEED  FOR  RANDOM  NUMBER  GENERATOR 

* 

c 

* 

TMU 

- 

TRUE  MEAN  OF  PARENT  NORMAL 

* 

c 

* 

TSIG 

- 

TRUE  STD.  DEV.  OF  PARENT  NORMAL 

* 

c 

* 

TXI 

- 

TRUE  LOCATION  VALUE 

* 

c 

* 

BMU 

- 

INTERPOLAT IVE  MLE  OF  TMU 

* 

c 

* 

BSIG 

- 

INTERPOLATIVE  MLE  OF  TSIG 

* 

c 

* 

BXI 

- 

INTERPOLAT IVE  MLE  OF  TXI 

* 

c 

* 

HMU 

- 

ITERATIVE  MLE  OF  TMU 

* 

c 

* 

HSIG 

- 

ITERATIVE  MLE  OF  TSIG 

* 

c 

* 

HXI 

- 

ITERATIVE  MLE  OF  TXI 

* 

c 

* 

N 

- 

DESIRED  SAMPLE  SIZE 

* 

c 

* 

NC 

- 

#  OF  DEVIATES  CENSORED  FROM  BELOW 

* 

c 

* 

NREPS 

- 

NUMBER  OF  REPLICATIONS 

* 

c 

* 

MLE 

- 

SUBROUTINE  TO  COMPUTE  ITERATIVE  MLE '3 

* 

c 

* 

GGLNG 

- 

IMSL  ROUTINE  WHICH  GENERATES  2-LND  DEVIATE* 

c 

* 

VSRTA 

- 

IMSL  ROUTINE  WHICH  ORDERS  DATA 

* 

c 

* 

SUM 

- 

DUMMY  VARIABLE  USED  TO  COMPUTE  SAMPLE 

* 

c 

* 

MEANS  AND  STD.  DEVS. 

* 

c 

* 

SUMR 

- 

DUMMY  VA  IABLE  USED  FOR  SUMS 

* 

c 

* 

SUMR1 

- 

DUMMY  VARIABLE  USED  FOR  SUMS 

* 

c 

* 

X 

- 

VECTOR  OF  3-LND  DEVIATES 

* 

c 

* 

MEAN 

- 

SAMPLE  ARITHMAT IC  MEAN 

* 

c 

* 

SD 

- 

SAMPLE  STANDARD  DEVIATION 

* 

c 

* 

Y1 

- 

MEDIAN  RANK  OF  FIRST  ORDER  STATISTIC 

* 

c 

* 

Y2 

- 

MEDIAN  RANK  OF  SECOND  ORDER  STATISTIC 

* 

c 

* 

SLOPE 

- 

SLOPE  OF  INTERPOLATION  LINE 

* 

c 

* 

* 

c 

* 

1/0  files: 

INPUT 

- 

UNFORMATTED  INPUT  OF  TRUE  PARAMETERS 

* 

c 

* 

TAPES 

- 

OUTPUT  OF  TRUE  PARAMETERS,  SAMPLES, 

* 

c 

* 

MEANS,  SD,  AND  ALL  MLE  ESTIMATES 

* 

c 

* 

* 

c 

* 

important: 

IMSL 

LIBRARY  MUST  BE  ATTACHED  BEFORE  THE  PROGRAM 

* 

c 

* 

IS  RUN. 

REVIEW  IMSL  MANUAL  ON  GGNL.G  AND  VSRTA 

* 

c 

* 

* 

C  ****************************************************************** 

C 
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****** 


nonn  ~  r.  n  non 


C 


100 

C 

103 

C 


101 

10 


20 


30 


COMMON  N ,  NC ,  X  <  50 )  ,  I  IHtJ , HS 1 0 , HX I , J , S IG , NN 

EXTERNAL  CGNLG , VStRA , MLE 

DOUBLE  PRECISION  DSEED 

REAL  TMU , TS IG , TXI , BMU , BSIG , BX I 

REAL  MEAN  ,  SUM  ,  SD  r  SLOPE  ,  Y  .1. ,  Y2 

INTEGER  N 

PSEED=185921752.ri0 

***  READ  PARAMETERS  AND  WRITE  THEM  TO  FILE  *** 

PRINT* ,  '  ENI  ER  ThU  ,  I  SI  G  ,  TXI ,  N  ,  NITERS ' 

READ* , 1  MU ,  TS1G ,  TXI , N , NREPS 
WRITE (5,100)  NREPS , N , TMU , TSIG , TXI 
FORMAT ( 1 4/13/3 (F 15. 6/)  ) 

******  BEGIN  DO- LOOP  FOR  GENERATION  OF  SAMPLES  ****** 

DO  999  J=l, NREPS 
FORMAT ( 14) 

***  GENERATE  AND  SORT  SAMPLES  FROM  2-LN  DISTRIBUTION  * 
CALL  GGNLG  (  DSEED ,  N  ,  TMtJ ,  TS I G ,  X  ) 

WRITE ( 5 , 103 )  J 
CALL  VSRT A  <  X , N ) 

***  ADD  LOCATION  PARAMETER  TO  2-LN  DEVIATES  *** 

***  WRITE  THE  3-1.  N  DEVIATES  TO  FILE  #** 

***  CALCULATE  SAMPLE  MEAN  *** 

SUM=0.0 
DO  10  1=1, N 
X(I)=X(I )+TXI 
WRITE (5,101)  X(I) 

FORMAT (F 13. 6) 

SUM=SUM  +  X(I) 

CONTINUE 
ME  AN=SIJM/N 

***  CALCULATE  SAMPLE  STANDARD  DEVIATIONS  *** 

SUM=0 . 0 
DO  20  I ■- 1 , N 

SUM  =  SUM+ ( X ( I ) -MEAN ) * ( X ( I ) -MEAN  > 

CONTINUE 

SD= ( BUM/N  >  **0  ♦  5 

***  CALCULATE  MEDIAN  RANKS  *** 

***  INTERPOLATE  TO  ESTIMATE  BXI  *** 

Yl=( 1 . 0-0 • 3 ) / ( N+0 , 4 ) 

Y2--  <  2 . 0-0 . 3 )  /  ( NTO  .  4 ) 

SLOPE  =  C  Y2-Y 1 ) / ( X ( 2) -X  ( 1 )  ) 

BXI  =  X  ( 1 )  -Yl/SLOF’E 

***  CALCULATE  HI..E  OF  JMU  AND  TSJG  USING  THE  INTERPOL  AT! 
**#  VALUE  FOR  THE  LOCATION  PARAMETER,  XI  *** 

***  USE  FIRST  ORDER  STATISTIC  IF  THE  INTERPOLATED  VAI  IJf 
***  OF  THE  LOCATION  PARAMETER  IS  CLOSE  TO  X(l)  *** 

IF  (BXI  .GT.  X ( 1 )  )  BXI=X(1) 

SIJM=0 . 0 
DO  30  I = 1 , N 

SUM=SUMTLOG(X( I) -BXI) 

CONTINUE 
PMU=SUM/N 
SUM=0 . 0 
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DO  40  1  =  1,  N 

GUM  =  GUM  +  ( LOG ( X <  I )  -BXI )  -  BMU)##2 
40  CONTINUE 

BS1  (3=BQF\T  (SUM/N) 

C  #*#  CALCULATE  THE  MLE ' S  USING  THE  CENSORED  MAXIMUM  LIKELIHOOD  * 

C  ***Em!ATIONS  AND  THE  ITFFsATIVE  PROCEDURE 

CALL  MLE(SD) 

C  *##WPITE  THE  Ml  F. 'S,  SAMPLE  MEAN  AND  STANDARD  DEVIATION### 

C  **#  TO  FILE  FOR  EACH  SAMPLE  ### 

WRITE  <  5 , 1 02 )  PHU , BS1 0 , BXI , HMU , HSIG , HX 1 , MEAN , SD 
102  FORMAT ( F15 ♦ 3/F15 » 3/F15 ♦ 3/F15  »3/F 15, 3/F15 • 3/F IS  #  6/F15 . 6 ) 

C  ****###  END  DO-LOOP  #*##### 

999  CONTINUE 
STOP 
END 
C 
C 

SUBROUTINE  Ml.E(SD) 

C  ###########*##########*##*###**#######*######*######*####*#**##### 


C 

# 

# 

C 

# 

purpose: 

(1)  CALCULATE  THE  ITERATIVE  MLE  '  3  FOR  TM'J  AND 

# 

c 

# 

T 

SIG 

I  WHILE  TX I =X ( 1  ) 

* 

c 

# 

# 

c 

* 

variables: 

SIJMN 

- 

DUMMY  USED  FOR  SUMS 

# 

£ 

# 

SUMO 

- 

DUMMY  USED  FOR  SUMS 

# 

c 

# 

PARI 

- 

INITIAL  S  FINAL  ESTIMATE  OF  TSIG  VIA 

# 

c 

# 

EQUATING  SAMPLE  AND  POPULATION  SKEWNESS 

* 

c 

* 

XBAR 

- 

SAMPLE  MEDIAN 

# 

c 

# 

SIG 

- 

FINAL  ESTIMATE  OF  TSIG  FROM  SAMPLE 

# 

c 

# 

SKEWNESS 

# 

c 

# 

Cl 

- 

ARRAY  OF  CONSTANTS  FOR  USE  BY  ZSCNT 

# 

c 

# 

FCNO 

- 

FUNCTION  USED  TO  CALCULATE  EXPCSIG*#2J 

# 

c 

# 

FCN1 

- 

FUNCTION  USED  TO  CALCULATE  MLE  OF  1  MU 

# 

c 

# 

FCN2 

- 

FUNCTION  USED  TO  CALCULATE  MLE  OF  TSIG 

# 

c 

# 

# 

c 

# 

note: 

UNDEFINED  VARIABLES  IN  THIS  SUBROUTINE  ARE 

* 

c 

# 

DEFINED 

IN  THE  MAIN  PROGRAM 

* 

c 

# 

# 

C  in######*#*##**##*********#*####***##**###*###########***##*###*#*# 

COMMON  N , NC , X < 50  I , HMU , HSIG , HXI , J , SIG,NN 
EXTERNAL  F CNO, FCN1 , FCN2 

REAL  WK ( 04 ) , FNORM , PAR ( 1 ) ,C ( 1 ) , SD , PARI ( 1 ) ,C1 ( 1 ) 

INTEGER  J 

INTEGER  N , NC , NS I G , NPAR , TTMAX , TER , MAXFN  , K 
C  ###  CHECK  IF  ANY  OF  THE  SAMPLE  POINTS  ARE  CLOSE  TO  THE  ### 

C  ###  FIRST  ORDER  STATISTIC  -  CENSOR  THEM  IF  THEY  ARE  ### 

NC  =  .l 

DO  10  I =2 , N 

IF  <  ( X ( I ) -X <  1 )  )  » LE .  0,001)  NONCT1 
10  CONTINUE 

C  #**  SET  ZGCNT  PARAMETERS  FOR  SKEWNESS  AND  MLE  CALCULATIONS  ### 

NPAR=1 
NSIG=3 
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n  n 


NF'ARI  =1 

ITMAX=1000 

NN=0 

C ( 1 >  =0 . 0 

C  ***  CALCULATE  SAMPLE  SKEWNESS  AND  SIGMA  FROM  THE  SKEWNESS  *** 

C  ***  EQUATION  -  RESULTS  ARE  USED  AS  INITIAL  ESTIMATES  *** 

XBAR=X  (INI  <  (N-NO/2)  ) 

SUMN=0 • 0 

suMii-o.o 
no  8  I  =NC  , N 

SUMN=SUMN+<X< I >-XBAR>**3 
SUM]i~SUMD+  (  X  (  I  > -XBAR  )  **2 
8  CONTINUE 

BIJMN=SUMN/ ( N-NC ) 

SUMD= ( SUMD/ ( N-NC ) )**1 .5 
C 1 ( 1 ) -SUMN/SUMB 
PARK  1  >=LOG<GD> 

IF  <  PARI  ( 1 )  .  L  T  .  O.OO.t)  PAR  1(1)  =  1*0 

CALL  2SCNT ( FCNO , NS I G , NPAR 1 , ITMAX , Cl , PARI , FNORM ,  UK ,  I ER ) 

SIG=SQRT  ( LGG ( ABS  <  PARI  (  1  )  )  )  ) 

*#*.  SET  INTI AL  VALUES  FOR  THE  ITERATIVE  MLE  PROCEDURE  AND  *** 
***  CALCULATE  THE  MLE  OF  TMU  AND  TSIG  *** 

HMU=LOG ( XI AR ) 

H3IG-SIG 

IF  (SIG  .LT.  1.001)  HSIG=PAR1 ( 1 ) 

PAR  <  1  )~HM'J 

CALL  ZSCN T <FCN1,NSI0, NPAR, ITMAX, C, PAR, FNORM, UK, I ER) 

HMU=PAR ( 1 ) 

PAR ( 1 ) =  HS I G 

CALL  ZSCN i ( FCN2 , NSIG , NPAR , ITMAX , C , PAR , FNORM , UK , I ER ) 

HSIG=PAR( 1 > 

RETURN 
END 
C 

SUBROUTINE  FCN 1 < PAR , F , NP , C ) 

C  ***  CALCULATE  THE  CENSORED  i-iLE  OF  TMU  *** 

DIMENSION  PAR(NP) ,F(NP) ,C( 1 ) 

COMMON  N , NC ,  X  < SO ) , HMU , HSI G , HXI , J , SIG , NN 
SUM=0 . 0 
HX I =X ( NC ) 

T= ( LOG ( X  <  NC  +  1 > -HXI >-PAR< 1 ) )/HSIG 
FU= . 3989423*FXP ( - ( Y **2 ) /2 ) 

CALL  MDNOf  :  (  Y  ,  Z  ) 

R=NC*(FU/Z) 

DO  10  1=1  PNC,  II 

f>UM=SUM+  ( LUG  (  X  ( I )  -HXI )  -PAR  ( 1 )  >/HSTG 
10  CONTINUE 
F< 1 )=SUM-R 
RETURN 
END 
C 
C 

SUBROUTINE  FCN2 ( PAR , F , NP , C ) 
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,  AD- A 1 38  007  ROBUST  MINIMUM  DISTANCE  ESTIMATION  OF  THE  THREE 

PARAMETER  LOGNORMAL  DISTRIBUTION(U)  AIR  FORCE  INST  OF 
TECH  WRIGHT-PATTERSON  AFB  OH  SCHOOL  OF  ENG I 
UNCLASSIFIED  d  H  KEFFER  DEC  83  AFIT/GOR/MA/83D-3  F/G  12/1 


MICROCOPY  RESOLUTION  7 ESI  CHART 

NATIONAL  HUKI.au  0»  SI ANOAHDS  l‘K>.T  A 


on  no 


#**  CALCULATE  THE  CENSORED  MLE  OF  TSIG  *** 

DIMENSI  ON  PAR  ( NP  )  ,  F  ( NF‘ ) ,  C  ( 1  > 

COMMON  N,NC, X(50) ,HMU,HSIG,HXI , J,SIG,NN 

*#*  Ir  THE  PAR ( 1 )  ESTIMATE  IS  INFEASIBLE,  CHANGE  THE  INITIAL  **# 
***  ESTIMATE  TO  THE  SIGMA  OBTAINED  FROM  THE  SKEWNESS  EON.  **# 
IF  < PAR <  1 )  * L.E »  SJG  .AND.  NN  .EQ.  0)  THEN 
PAR  < 1 >  =SIG 
NN=1 
END  IF 
SUM=0 . 0 
HXI=X(NC) 

Y=  <LOG<  X  <  NC+1 ) -HXI > -HMU ) /PAR( 1 > 

FU= . 398?423*EXP  < - ( Y#*2 ) /2 ) 

CALL  MDNOR(Y,Z) 

R=NC*(Y*FU)/Z 
DO  10  I=1+NC,N 

SUM=SUM+  <  < LOG ( X < I ) -HXI > -HMU ) /PAR ( 1 ) ) **2 
10  CONTINUE 

F  < 1 ) =SUM- ( N-NC ) -R 

RETURN 

END 


SUBROUTINE  FCNO < PARI ,F , NP ,C1 > 

C  #**  CALCULATE  SIGMA  FROM  THE  SKEWNESS  EON  -  PAR1(1)=W  *** 
REAL  F<NP) yCl(NP) ,PAR1 (NP) 

F< 1 )=< <PAR1<1)+2^**2)»<PAR1(1)-1>-  (C1<1>>**2 

RETURN 

END 

*EOR 
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jendix  C 


Computer  Listing  of  Minimum 
Distance  Techniques 


PROGRAM  MLEMD 

*************************  ****** **#*##**#*#****##****#**#**###***** 


WRITTEN 


KEEPER  AFIT/G0R-83D  FOR  MS  THESIS 
DECEMBER  1983 


PURPOSE?  MINIMUM  DISTANCE  ESTIMATION  OF  THE  LOCATION 

PARAMETER  FOR  THE  3-LN  DISTRIBUTION  USING  THE 
INTERPOLATE  AND  ITERATIVE  MLE'S.  THE  MINIMUM 
DISTANCE  ESTIMATORS  ARE  THE  KOLMOGROV  DISTANCE, 
CRAMER- VON  M15ES  DISTANCE,  AND  THE  ANDERSON- 
DARLING  STATISTICS 


variables: 


NREPS 

N 

TMU 

TSIG 

TXI 

BMU 

BSIG 

BSIG 

BXI 

HMU 

HSIG 

HXI 

K 

N0B 


NC 

NN 

BXIC 

X 

MEAN 

SD 

PAR 

KXI 

CVMXI 

A2XI 

HONOR 

2XMIN 

NPAR 

MAXFN 

NSIG 

I  OPT 

DKSXI 

DCVXI 

DA2XI 


NUMBER  OF  REPLICATIONS 
SAMPLE  SIZE  (INPUT) 

TRUE  MEAN  OF  PARENT  NORMAL  (INPUT) 

TRUE  STANDARD  DEVIATION  OF  PARENT  NORMAL 
TRUE  VALUE  OF  THE  LOCATION  PARAMETER 
INTERPOLATIVE  MLE  OF  TMU 


-  INTERPOLATIVE  MLE  OF  TMU 

-  INTERPOLATIVE  MLE  OF  TSIG 

-  INTERPOLATIVE  MLE  OF  TSIG 

-  INTERPOLATIVE  MLE  OF  TXI 

-  ITERATIVE  MLE  OF  TMU 

-  ITERATIVE  MLE  OF  TSIG 

-  ITERATIVE  MLE  OF  TXI 

-  SAMPLE  INDEX 

-  DIVERGENCE  COUNTER  OF  INTE 
WHERE  0  IS  INITIAL  OF  MD  f 

-  DIVERGENCE  COUNTER  OF  ITEF 
WHERE  0  IS  INITIAL  OF  MD  F 

-  #  OF  DEVIATES  CENSORED  FRC 

-  CHECK  FOR  DIVERGENCE  OF  KC 


INTERPOLATIVE  MLE'S  * 
MD  METHOD  * 

ITEFlATI  VE  MLE'S  * 
MD  METHOD  * 

FROM  BELOW  * 

F  KOLMOGROV  ESTIMATE* 


NEW  INITIAL  ESTIMATE  -  USED  WHEN  FIRST 
INITIAL  ESTIMATE  DIVERGES 
ARRAY  OF  3-LN  DEVIATES 
SAMPLE  MEAN 

SAMPLE  STANDARD  DEVIATION 
ARRAY  CONTA FNING  MLE  OF  XI  FOR  MD 
KOLMOGROV  MD  ESTIMATE  OF  XI 
CRAMER-VON  MISES  MD  ESTIMATE  OF  XI 
ANDERSON -DARLING  MD  ESTIMATE  OF  XI 
IMSL  ROUTINE  FOR  NORMAL  CDF  VALUES 
IMSL  ROUTINE  TO  MINIMIZE  THE  DISTANCE 
NUMBER  OF  VARIABLES  INPUTTED  BY  ZXMIN 
MAXIMUM  *  OF  ITERATIONS  BY  ZXMIN 
*  OF  SIGNIFICANT  DIGITS  ZXMIN  SLOVES  FOR 
ZXMIN  OPTION  (SEE  IMSL  MANUAL) 

SUBROUTINE  TO  FIND  DISTANCE  VIA  KOLMOGROV 
SUBROUTINE  TO  FIND  DISTANCE  VIA  CVM 
SUBROUTINE  TO  FIND  DISTANCE  VIA  A-D 


oooooonoonoononr 


*  F  DISTANCE  VALUE  IN  SUBROUTINES  * 

*  H,G,U  -  ARRAYS  USED  BY  ZXMIN  (SEE  IMSL  MANUAL)  * 

*  IER  -  IMSL  ERROR  MESSAGE  * 

*  * 

*  I/O  files:  TAPES  -  INPUT, CONTAINS  true  parameter  values,  * 

*  SAMPLE  DEVIATES, SAMPLE  MEAN  AND  STD.  DEV.  * 

*  AND  MLE  AND  MD  ESTIMATES  * 

*  * 

*  TAPE6  -  OUTPUT,  CONTAINS  TRUE  PARAMETER  VALUES  AND* 

*  ESTIMATED  PARAMETER  VALUES  FOR  EACH  * 

*  MINIMUM  DISTANCE  METHOD  * 

*  * 

*  IMPORTANT:  IMSL  LIBRARY  MUST  BE  ATTACHED  BEFORE  RUNNING  THE  * 

*  PROGRAM.  REVIEW  IMSL  MANUAL  ON  ZXMIN.  * 

*  * 


****************************************************************** 

COMMON  X(50) ,NC,N,BMU,BSIG,BXI,BXIC,NN 
EXTERNAL  ZXMIN , DCVXI , DA2X I , DKSX I , MLE , MDNOR 
DIMENSION  PAR(l) ,H(1>,G(1),W<3> 

INTEGER  N»K 

REAL  MEAN,KXI ,CVMXI , A2XI , MU,SIG,XI »SD 
C  ***  INITIALIZE  DIVERGENCE  COUNTERS  *** 

NKB=0 

NCB=0 

NAB=0 

NKH=0 

NCH=0 

NAH=0 

C  ***  INPUT  TRUE  PARAMETERS  *#* 

READ<5, 100)  NREPS,N,TMU,TSIG,TXI 

100  FORMAT  < 1 4/13/3 ( F 15 .6/ ) ) 

WRITE <7, 106 )  NREPS 

106  FORMAT (14) 

WRITE< 7 , 101 )  N 

101  FORMAT (13) 

URITE< 7 ,102)  TMU,TSIG,TXI 

102  FORMAT (F15.6/F15.6/F15.6/) 

C  *******  BEGIN  DO-LOOP  FOR  NREPS  SAMPLES  ******* 

DO  99V  J«l, NREPS 

C  ***  INPUT  SAMPLE  INDEX  *** 

RFAD<5» 106)  K 

C  ***  INPUT  SAMPLE  DEVIATES  *** 

DO  10  I“1 ,N 

RE AD (5, 103)  X(I) 

103  FORMAT <F lb.6) 

10  CONTINUE 

DO  40  1*1, N 
40  CONTINUE 

C  ***  INPUT  ESTIMATE  OF  XI,  SAMPLE  MEAN  AND  STD.  DEV.*** 

READ<5,  104)  BMl),BSIG,BXI  ,HMU,HSIG,HXI ,  ME  AN,  3D 

104  F0RMAT<F15.6/F15.6/F15.6/F15.6/F15.6/F15.6/F15.A/F15.6) 
BXIC-HXI-1.0 

C  ***  BEGIN  MD  ESTIMATION  USING  INTERPOLATIVE  MLE '8  *** 
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c 


HO  888  L=l,2 

#**  BEGIN  MD  ESTIMATION  USING  ITERATIVE  MLE'S  *** 

IF  (  L  .EQ.  2  )  THEN 
BMU=HMU 
BSIG-HSIG 
BXIC=BXI 
BXI==HXI-1 .0 
END  IF 

WRITE (7,105)  BMU , BS1G , BXI 
105  FORMAT (3(F15.6/‘ ) 

C  #**  SET  ZXMIN  PARAMETERS  *** 

NF'AR=1 

NSIG=3 

hAXFN=3000 

I0PT=0 

C  ***  MINIMIZE  DISTANCE  VIA  KOL.MOGROV  DISTANCE  *** 

PAR  < 1 ) =BXI 
NP=1 
NC-0 
NN=0 

CALL  DKSXI (NP, PAR) 

KXI=PAR ( 1 ) 

IF  (KXI  .LT.  X( 1 ) )  THEN 
CALL  MLE(KXI) 

ELSE 

IF  <L  .EQ.  1)  NKB=NKB+1 
IF  <L  .EQ.  2)  NKH=NKH+1 
WRITE<7, 107)  HMU,HSIG,HXI 

ENDIF 

C  ***  MINIMIZE  DISTANCE  VIA  CRAMER  VON-MISES  *** 

PAR< 1 )=BXI 

NC*0 

NN=0 

CALL  ZXMIN ( DCVX I , NPAR , NS IG , MAXFN , I OPT , PAR ,H,G,F,U,IER> 
CVMXI=PAR<1) 

IF  (CVMXI  .LT.  X<1)>  THEN 
CALL  MLE( CVMXI) 

ELSE 

IF  <L  .EQ.  2)  NCH=NCH+1 
IF  <L  .EQ.  1)  NCB=NCB+1 
WRITE  <  7, 107)  HMU,HSIG,HXI 
107  FORMAT (3CF15.6/)) 

ENDIF 

C  ***  MINIMIZE  DISTANCE  VIA  ANDERSON-DARLING  STATISTIC  *** 

PAR(1)=BXI 
NC=0 
NN*0 

CALL  ZXMI N <  DA2X I , NPAR , NSI G , MAXFN , I OPT , PAR , H , G , F , W , IER ) 
A2XI*PAR< 1 > 

IF  < A2XI  .LT.  X< 1 ) )  THEN 
CALL  MLE < A2XI > 

ELSE 

IF  <L  .EQ.  1)  NAB=NABF1 
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IF  (L  .  EQ.  2)  NAH=NAH+1 
WRITEC7 , 107)  HMU,HSIG,HXI 

END  IF 
NC=0 

888  CONTINUE 

C  #####*#END  D0-L00P  FOR  NREPS  ******* 
999  CONTINUE 

WRI TE  <  7 , 1 08  >  NKB , NCB , NAB , NKH , NCH , NAH 
108  F0RMAT(I4,I4,I4,I4,I4, 14) 

STOP 

END 


SUBROUTINE  MLE (XI) 

****************************************************************** 
*  PURPOSE:  CALCULATE  THE  MLE  OF  MU  AND  SI 6  GIVEN  THE  VALUE  * 


*  OF  THE  LOCATION  PARAMETER  (XI)  * 

*  * 

*  variables:  n  -  sample  size  * 

*  X  ARRAY  OF  3-LN  DEVIATES  * 

*  MU  -  MLE  ESTIMATE  OF  PARENT  NORMAL  MEAN  * 

*  SIG  -  MLE  ESTIMATE  OF  PARENT  NORMAL  ST.  DEV.  * 

*  XI  -  ESTIMATE  OF  LOCATION  PARAMETER  (INPUT)  * 

*  SUM  -  DUMMY  VARIABLE  USED  FOR  SUMS  * 

*  * 


****************************************************************** 

COMMON  X  (50)  , NC , N , BMU , BSIG , BXI ,BXIC  »NN 
REAL  XI » MU, SIG, SUM, Y  ,  Z 
INTEGER  N 

*#*  INITIALIZE  PARAMETERS  *** 

SUM=0.0 

MU=0.0 

SIG=0.0 

***  CALCULATE  THE  MLE  OF  MU  *** 

DO  10  1=1 ,N 

SUM=SUM+LOG ( X ( I ) -X I ) 

10  CONTINUE 
MU=SUM/N 

***  CALCULATE  THE  MLE  OF  SIG  *** 

SUh=0.0 
DO  20  1=1, N 

SUM  «  (  LOG(X(I)-XI>  -  MU  )**2  +  SUM 
20  CONTINUE 

SIG=SQRT (SUM/N) 

WRITE<7, 102)  MU, SIG, XI 
102  FORMAT  <3(F15.6/)) 

RETURN 

END 


SUBROUTINE  DKSXI (NP,PAR> 
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*####*#*#***#***##**#****#***#*#**x<****#**#******#**#**#*#*#****#* 


*  y 

*  purpose:  find  distance  BETWEEN  ESTIMATED  CDF  AND  1/N  * 

*  STEP  FUNCTION  FOR  XI  VIA  KOLMOGROV  DISTANCE  * 

*  # 

*  variables:  NP  -  NUMBER  OF  PARAMETERS  (ALWAYS  1)  * 

*  PAR  -  ARRAY  OF  PARAMETER  VALUES  * 

*  PLUS  -  DUMMY  VARIABLE  FOR  D+  * 

)k  MINUS  -  DUMMY  VARIABLE  FOR  D-  * 

)k  DMAX  -  VALUE  OF  D+  STATISTIC  * 

3k  DMIN  -  VALUE  OF  D-  STATISTIC  * 

*  COUNT  -  LOCATION  OF  POINT  WHERE  MINCMAX  DISTANCE]  Jk 

*  M  -  #  OF  DEVIATES  REMAINING  AFTER  NC  CENSORED  Jk 

*  FROM  BELOW  * 

Jk  Z  PERCENTILE  POINT  FROM  STD  NORMAL  CDF  COR-  Jk 

>k  RESPONDING  TO  THE  3-LN  CDF  PERCENTILE  F'T »  * 

>k  3k 

)k  NOTE:  UNDEFINED  VARIABLES  ARE  DEFINED  IN  THE  MAIN  PGM  * 

3k  * 


3**************************#*****#**********#)*^#******************* 
COMMON  X(50) ,NC,N,BMU,BSIG,BXI,BXIC,NN 
INTEGER  NP » COUNT  , M 

REAL  PAR(NF')  ,  MAXI  ,  PLUS, MINUS, DMAX,  Y,Z 

***  USE  FIRST  ORDER  STATISTIC  IF  ESTIMATED  VALUE  OF  XI  IS  *** 

***  CLOSE  TO  THE  FIRST  ORDER  STATISTIC  *Jk* 

IF  < (X(l)-PAR(l) )  .LT.  O.OOl)  THEN 
NC=1 

DO  3  1=2, N 

IF< (X(I)-X(l))  .LT.  0.001)  NC=NC+1 
3  CONTINUE 

PAR< 1 )=X(NC) 

ENDIF 

MAXI=99999 • 

M=N-NC 

C  ***  CALCULATE  KOLMOGROV  STATISTICS  *** 

2  DO  5  J=1 , 400 

XII=PAR(l)  +  (-2.0+.0l!kJ) 

IF  ( X < 1+NC )  .LT.  XII  +  0.001)  GOTO  5 
DO  10  1=1, M 

Y= ( LOG (X(NC+I)-XII) -DMU ) /BSIG 
CALL  MDNOR ( Y , Z ) 

PLUS=ABS(  I/M  -  Z) 

MINUS=ABS(  Z-< 1-1 )/M> 

DMAX=MAX( PLUS, MINUS) 

IF  (DMAX  .LT.  MAXI)  THEN 
MAXI=OMAX 
COUNT* J 
ENDIF 
10  CONTINUE 
5  CONTINUE 

C  *$$  USE  NEW  INITIAL  ESTIMATE  FOR  XI  IF  AT  BOUNDARY  *Jk>k 
IF  (COUNT  .EG.  400  .AND.  NN  .EG.  0)  THEN 
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PAR(1)=BXJC 
NN=  1 
GO  TO  2 
ENDIF 

C  ***  USE  NEW  INITIAL  ESTIMATE  FOR  XI  IF  AT  BOUNDARY  #** 
IF  (COUNT  .E«.  1  .AND.  NN  .EQ.  0>  THEN 
PAR< 1 )=BXIC 
NN=1 
GO  TO  2 
ENDIF 

PAR  (  1 )  =BXT  +  ( -2 . 0+ .  C- 1 *COUNT  > 

RETURN 

END 


SUBROUTINE  DCVXI (NP, PAR,F> 


#***#*#*#*####**#***#####*######*###**#**********#**************** 


* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 


purpose: 


variables: 


note: 


FIND  DISTANCE  BETWEEN  ESTIMATED  CDF  AND  1/N 
EDF  FOR  THE  LOCATION  PARAMETER,  XI,  VIA  CRAMER 
VQN-'MISES  DISTANCE 

NP  -  NUMBER  OF  PARAMETERS  (ALWAYS  1) 

PAR  -  ARRAY  OF  PARAMETER  VALUES 
SUM  -  DUMMY  VARIABLE  USED  FOR  SUMS 
F  -  DISTANCE  VALUE  AT  THIS  XI 
Z  -  PERCENTILE  PT  FROM  STD  NORMAL  CORRESPOND¬ 
ING  TO  THE  3-LN  CDF  PERCENTILE  PT. 

UNDEFINED  VARIABLES  IN  THIS  SUBROUTINE  ARE 
DEFINED  IN  THE  MAIN  PROGRAM 


* 

* 

* 

* 

* 

* 

* 

* 

It 

* 

* 

* 

* 

* 

* 


fc*##**#**###*##*#*####*########*#*#*#*#**#***##########*####*##*** 


COMMON  X(50) ,NC,N,BMU,BSIG,BXI ,BXIC,NN 
INTEGER  NP 

REAL  PAR ( NP ) , F , SUM , Y , Z 

#**  USE  FIRST  ORDER  STATISTIC  IF  ESTIMATE  OF  XI  LIES  *** 

#**  CLOSE  TO  THE  FIRST  ORDER  STATISTIC  *** 

IF  ( (X(l)-PAR(l) >  .LT.  0.001  .AND.  NN  .*E0.  0)  THEN 
NN=1 

PAR(  1  )=>BXIC 
ENDIF 

C  ***  IF  MD  ESTIMATE  DIVERGED  FOR  XI  USE  A  NEW  INITIAL  ESTIMATE  *** 
IF  < (X(l)-PAR(l) >  .LT.  0.001  .AND.  NN  .EO.  1)  THEN 
NC«1 

DO  3  1=2, N 

IF( (X(I)-X(l) )  .LT. 0,001)  NC=NCI 1 
3  CONTINUE 

PAR< 1 )=X(NC) 

ENDIF 

IF  (NC  .GT.  0)  PAR(1)=X(NC) 
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w  ***  CALCULATE  CRAMER-VQN  MISES  DISTANCE  *** 
SUM=0.0 
M=N-NC 
DO  10  1=1, M 

Y= ( LOG  <  X  < I FNC ) -PAR ( 1 ) ) ~BMU ) /BSI 0 
CALL  MDN0R<  Y,Z) 

SUM=  (Z  -  (2*1-1 .(>)/<2.0*M>  >**2  +1/12#M  +  SUM 
10  CONTINUE 

C  ***  SET  F  EQUAL  TO  CRAMER  VON  MISES  DISTANCE  *** 
F=5UM 
RETURN 
END 


SUBROUTINE  DA2XI (NP,PAR,F) 

******************:;c#****v'##*#*#*******##***!K****#***#****#*#*****# 
*  * 

*  PURPOSE:  FIND  DISTANCE  BETWEEN  ESTIMATED  CDF  AND  1/N  * 

*  EDF  FOR  LOCATION  PARAMETER,  XI,  VIA  ANDERSON-  * 

*  DARLING  DISTANCE  * 


variables: 


NUMBER  OF  PARAMETERS  (ALWAYS  1)  * 
ARRAY  OF  PARAMETER  VALUES  FOR  XI  *■  * 
DUMMY  VARIABLE  USED  FOR  SUMS  * 
DISTANCE  AT  THIS  XI  * 
VALUE  FOR  A-D  STATISTIC  * 
VALUE  FOR  A-D  STATISTIC  * 
PERCENTILE  PT  FROM  STD  NORMAL  CORRESPONDING  * 
TO  Y1 

PERCENTILE  PT  FROM  STD  NORMAL  CORRESPONDING  * 
TO  Y2  * 


note: 


VARIABLES  NOT  DEFINED  IN  THIS  SUBROUTINE  ARE 
DEFINED  IN  THE  MAIN  PROGRAM 


^  dl  ^  ^  ^  ^  ^  ^  ^  ^  ^  ^  ^  ^  ^  ^  ^  cV  ^  ^  iL  vl«  ^  If  ^  »li  «1«  ab  ^  ^  ^  ^  ^  ib  ^  ^  Jj  ^  db  ^  ^  ^  ^  ^  ^  ^  ^  ^  d/  ^  ^  ^  ^  ^  ^  ll/  d#  df  dl  d#  df  df 

^  ^  ^  ^  ^  ^  ^  ^  ^  ^  ^  ^  ^  ^  ^  ^  ^  ^  ^  ^  ^  ^  ^  ^  ^  ®  ®  ®  ^  ^  ^  ^  ^  ^  ^  ^  ^  ^  ^  ^  *  T'  ^  ^  n*  *  *P  ^p  *  ^  *  V  ® 

COMMON  X<50),NC,N,BMU,BSIG,BXI ,BXIC,NN 
INTEGER  NP 

REAL  PAR ( NP  > , F , SUM , Y 1 , Y2 , Z1 »  Z2 

***  USE  FIRST  ORDER  STATISTICS  IF  ESTIMATE  OF  XI  LIES  *** 

**#  CLOSE  TO  THE  FIRST  ORDER  STATISTIC  **# 

IF  <  <X< 1 >-PAR( 1 ) )  .LT.  0.001  .AND.  NN  ,EQ.  0)  THEN 
NN=1 

PAR(1)=BXIC 
END  IF 

***  IF  FIRST  INITIAL  ESTIMATE  DIVERGES  USE  A  NEW  INITIAL  ESTIMATE 
IF( <X(1) -PAR ( 1 ) )  .LT.  0.001  .AND.  NN  .EQ.  1)  THEN 
NC=1 

DO  3  1=2, N 

IF  <(X<I)-X<1>>  .LT.  0.001)  NONC+1 
CONTINUE 


y 


PARC  1 >=X ( NC ) 

END  IF 

IF  (NC  * GT .  0)  PARU)=X(NC) 

C  ***  CALCULATE  ANDERSON-DARLING  STATISTICS  *** 
SUM=0,0 
M=N-NC 
DO  10  1=1, M 

Yl-C LOG < X ( I +NC ) -PAR < 1 >  > -BMU > /BSIG 
CALL  MDN0RCY1,Z1) 

Y2= (LOGCXCN-H-I) -PARC  1 ) )-BMU> /BSIG 
CALL  MDN0RCY2,Z2) 

SUM* (2*1-1  )*(L0GCZ1  )+LOG(  1-Z2)  )  +  SUM 
10  CONTINUE 

SUM=(-1#SUM)/M-M 

C  #**  SET  F  EQUAL  TO  ANDERSON-DARLING  DISTANCE  ##* 
F=SUM 
RETURN 
END 
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Appendix  D 

Computer  Listing  of 
Evaluation  CriterTa 


PROGRAM  EVAL 

C  **************************************************  **************** 


c 

* 

* 

c 

* 

WRITTEN  BY 

2LT  JIM 

H.  KEFFER  AFIT/G0R-83D  FOR  MS  THESIS 

* 

c 

* 

DECEMBER  1983 

* 

c 

* 

* 

c 

* 

purpose: 

EVALUATE  THE  MLE  AND  MB  PARAMETER  ESTIMATES  BY: 

* 

c 

* 

<1>  MEAN  SQUARE  ERROR  (MSE) 

* 

c 

* 

(2)  RELATIVE  EFFICIENCY  ( REFF > 

* 

c 

* 

(3)  CRAMER-VON  MISES  DISTANCE  (CVM) 

* 

c 

* 

* 

c 

* 

variables: 

NREPS 

-  NUMBER  OF  REPLICATIONS 

* 

c 

* 

N 

-  SAMPLE  SIZE 

* 

c 

* 

SUM 

-  DUMMY  USED  FOR  SUMS 

* 

c 

* 

SUM1 

-  DUMMY  USED  FOR  SUMS 

* 

c 

* 

X 

-  ARRAY  CONTINING  LAGUERRE  ABCISSA 

* 

c 

* 

U 

-  ARRAY  CONTAINING  LAGUERRE  WEIGHTS 

* 

c 

* 

CVM 

-  MATRIX  CONTAINING  CVM  DISTANCE  FOR  EVERY* 

c 

* 

SAMPLE  AND  ESTIMATOR 

* 

c 

* 

MCVM 

-  ARRAY  CONTAINING  MEANS  OF  CVM  DISTANCE 

* 

c 

* 

FOR  EVERY  ESTIMATOR  TYPE 

* 

c 

* 

SDCVM 

-  ARRAY  CONTAINING  SD  OF  CVM  DISTANCE  FOR 

* 

c 

* 

EVERY  ESTIMATOR  TYPE 

* 

c 

* 

F 

-  VALUE  OF  TRUE  PDF 

* 

c 

* 

TMU 

-  TRUE  VALUE  OF  MU 

* 

c 

* 

TSIG 

-  TRUE  VALUE  OF  SIGMA 

* 

c 

* 

TXI 

-  TRUE  VALUE  OF  XI 

* 

c 

* 

BM,BS, 

BX  -  INTERPOLATIVE  MLE'S  FOR  TMU, TSIG, TXI 

* 

c 

* 

HM , HS , 

HX  -  ITERATIVE  MLE'S  FOR  TMU , TSIG , TXI 

* 

c 

* 

BO* 

-  MSE  OF  ESTIMATES  FROM  INTERPOLATIVE  MLE 

* 

c 

* 

WHERE  0=  INITIAL  OF  MD  ESTIMATOR 

* 

c 

* 

AND  *  ■  M  FOR  MUJ  S  FOR  SIGMA}  X  FOR  XI 

* 

c 

* 

HO* 

-  MSSE  OF  ESTIMATES  FROM  ITERATIVE  MLE'S 

* 

c 

* 

WHERE  0  ■  INITIAL  OF  MD  ESTIMATOR 

* 

c 

* 

AND  *  *  M  FOR  MU}  S  FOR  SIGMA}  X  FOR  XI 

* 

c 

* 

RB/H0* 

-  RELATIVE  EFFICIENCIES  FOR  BP*  AND  H0* 

* 

c 

* 

YI 

-  ADJUSTED  LAGUERRE  ABCISSA  FOR  1=0,8 

* 

c 

* 

ZI 

-  PERCENTILE  PT  FROM  STD  NORMAL  CDF  USING 

* 

c 

* 

A  SET  OF  PARAMETER  ESTIMATES 

* 

c 

* 

MDNOR 

-  IMSL  ROUTINE  WHICH  CALCULATES  THE  CDF 

* 

c 

* 

OF  A  STD  NORMAL  DISTRIBUTION  AT  PT.  X 

* 

c 

* 

* 

c 

* 

i/o  files: 

TAPE7 

-  FORMATTED  INPUT  OF  TRUE  PARAMETERS  AND 

* 

c 

* 

ESTIMATES 

* 

c 

# 

TAPE8 

-  FORMATTED  OUTPUT  OF  MSE'S,  REFF'S  AND 

* 

c 

* 

MCVM, SDCVM 

* 

c 

* 

* 

c 

* 

important: 

IMSL  LIBRARY  MUST  BE  ATTACHED  BEFORE  THE  PROGRAM 

* 

c 

* 

IS  RUN 

.  REVIEW  IMSL  SUBROUTINE  MDNOR 

* 

c 

* 

* 

■t*\i r  .  '  >  c- 
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*  II  I  Ji>X  ■  •  l  I  ~  . 


C 


100 

c 


101 


c 


****************************************************************** 


REAL  SUM <  8  >  , SUM 1(8), COM  <8,1 000 ) , MCVM  <  8 ) , SDCUM  <  8  > 

REAL  X(15),W<15) 

DATA  BM , BS , BX , BKM , BKS , BKX , BCM , PCS , PCX , BAM , PAS , BAX/1 2*0 . 0/ 
DATA  HM, HS,HX,HKM,HKS,HKX,HCM,HCS,HCX, HAM,  I  iAS,HAX/ 12*0.0/ 


DATA  SUM , SUMl/8*<> .  0 , 8*0 . 0/ 

DATA  ( X  <  K )  ,  K=  1 , 1 5 )  /0 . 0933 , 0 .49269 , 1 .21559 , 2 
+7 . 56591 ,10.1 2022 ,13. 13028 ,16. 6540 , 20 .77647 
+38 .53068 , 48 . 02608/ 


. 26994 , 3 . 66762 , 5 . 42533 
,25.62389,31.40751, 


, 


DATA  <  W  ( K  ) ,  K=  1 , 15 )  /<) .  23957 , 0 . 5601 , 0 . 887 , 1 . 22366 , 1 . 5744*  l .  94475 , 
+2.3415,2.77404,3.25564,3.80631 ,4.45847,5.2700! ,6.3595  3  03178, 
+11.52777/ 


PI=3. 1415927 

***  INPUT  TRUE  PARAMETER  ESTIMATES  *** 

REAEi<7, 100)  NREPS, N, TMU, FSIG, TXI 
FORMAT ( I4/13/F15 .6/F15. 6/F15.6/ ) 

*******  BEGIN  DO-LOOP  FOR  NREPS  SAMPLES  ******* 

DO  999  J=l, NREPS 

READ<7, 101 )  BMU , BSIG, BXI , BKMU,BKSIG, BKXI , BCMU , BCSIG , BCXI , 
+BAMU, BASIG»BAXI , HMU , HSIG , HXI ,HKMU , HKSIG , HKXI , HCMU ,HCSIG, 
+HCXI ,HAMU,HA3IG,HAXI 

FORMAT  <  F15 .6/F15 .6/F15 .6//F15 .6/F15 .6/F15 .6//F1 5. 6/Ft 5  *  6/ 
+F15. 6//F15 . 6/F15 . 6/F15 • 6//F15 . 6/F15 . 6/f 1 5.6//F15, 6/F15 . 6/ 
+F15.6//F15.6/F 15.6/F15.6//F15.6/F15.6/F15.6/) 

***  CALCULATE  THE  SQUARED  ERRORS  FDR  ALL  PARAMETERS  *** 

BM=BM+ <  TMU-BMU  >  **2 

BS=BS+<  TSIG-BSIG  > **2 

BX*BX+  <  TXI-BXI ) **2 

BKM=BKM+  <  TMU-BKMU ) **2 


BKS*BKS+  <  TS16- BKSIG ) **2 
BKX=BNX+<TXI-BKXI)**2 
BCM=BCM+ <  TMU-BCMU ) **2 
BCS=BCS+(TSIG-PCSIG)**2 
BCX=BCX+ (TXI- BCXI)** 2 
BAM=BAM+  <  TMU-BAMU ) **2 
BAS=PAS+(TSIG-BASIG) **2 
BAX=BAX+<TXI-BAXI>**2 
HM=HM+  <  TMU-HMU ) #*2 
HS=HS+(TSIG~HSTG>**2 
HX=HX+ ( TXI -HXI ^  **2 
HKM=HKM+  <  TMU-HKMU ) **2 
HKS=HKS+  <  TSIG- HKSIG ) **2 
HKX=»HKX+<  TXI -HKXI  )**2 
HCM=HCM+  <  TMU-  HCMU )  #*2 
HCS-HCS+ <  TSIG-HCSIG ) **2 
HCX=HCX+<TXI -HCXI )**2 
HAM“HAM+ <  TMU-HAMU ) **2 
HAS=HAS+ ( TSIG-HASIG ) **2 
HAX*HAX+<  TXI-HAXI )**2 

***  CALCULATE  THE  CRAMER  VON-MISES  DISTANCE  USING  NUMERICAL  *#* 

***  QUADRATURE  *** 

DO  10  1*1,15 


&?■ 


97 


***  ADD  LOCATION  TO  QUADRATURE  POINTS  *** 

X( I >=X  < I > FTXI 

***  CALCULATE  EOT  CDF  FROM  INTERPOLATION  *** 

IF  <  ( X < I > -BX I >  .LT.  O.OOOOl)  THEN 
21=0.0 
00 TO  11 
END  IF 

Y1=(L0G(X(I )-BXI)-BMU)/BXI 
CALL  HONOR <Y1.,Z1> 

*#*  CALCULATE  EST  CDF  FROM  INTERPOLATION  X  KOL  *** 

IF  <  (X<  I ) -BKXI )  .LT.  0.00001)  THEN 
22=0.0 
GOTO  12 
END  IF 

Y2=(L0G(X(  I ) -BKXI  )-BKMU)/BKSIG 
CALL  MDNOft ( Y2 . 22  > 

***  CALCULATE  EST  CDF  FROM  INTERPOLATION  X  CUM  **# 

IF  (  (X(I)-BCXI)  .LT.  0.00001)  THEN 
Z3=0 . 0 
GOTO  13 
END  IF 

Y3= ( LOG  <  X ( I ) -BCXI ) -BCMIJ  > /BCSIG 
CALL  MDNOR  >,  Y3  ,  Z3 ) 

***  CALCULATE  EST  CDF  FROM  INTERPOLATION  AND  A- D  *** 
IF  (  (X(I)-BAXI)  .LT.  0.00001)  THEN 
Z4=0 . 0 
GOTO  14 
END  IF 

Y4= (LOG<  X ( I ) -BAXI ) -BAMU ) /BASIG 
CALL  MDNOR <Y4,Z4) 

***  CALCULATE  EST  CDF  FROM  ITERATION  *** 

IF  (  (X(I)-HXI)  .LT.  0,00001)  THEN 
Z5=0.0 
GOTO  16 
END  IF" 

Y5= ( LOG ( X ( I ) - HX I ) -HMU ) /HS I G 
CALL  MDNOR <Y5VZ5> 

***  CALCULATE  EST  CDF  FROM  ITERATION  X  KOL  *** 

IF (  < X< I ) -HKXI )  .LT.  0.00001)  THEN 
ZA=0  •  0 
GOTO  17 
ENDIF 

Y6=  ( LOG(X  <  I )  -FIKXI  )  -HKMU)  /HKSIG 
CALL  MDNOR (Y6,Z6) 

***  CALCULATE  EST  CDF  FROM  ITERATION  X  CVM  *** 

IF  <  (X(I)-HCXI)  .LT.  0.00001)  THEN 
Z7=0 , 0 
GOTO  10 
ENDIF 

Y7=<L0G(X< I )-HCXI )-HCMU)/HCSIG 
CALL  MDNOR <Y7,Z7) 

*##  CALCULATE  EST  CDF  FROM  ITERATION  X  A~D  *** 

IF  <  (X(I)-FIAXI)  .LT,  0.00001)  THEN 

98 


c 


19 


C 

C 


9 

C 

999 

C 


Z8=0 . 0 
GOTO  19 
END  IF 

Y8= ( LOG ( X ( I ) -HAXI ) -HAMU  > /HAS! G 
CALL  MDNOR <  Y8  *  Z8 ) 

***  CALCULATE  TRUE  CDF  **# 

Y=(LOG(X< 1 )-rXI)-TMU)/TSIG 
CALL  MDNOR <Y,Z) 

***  EVALUATE  TRUE  PDF  *** 

F  =  (l/<  (X<I)~TXI>*TSIG*SGIRT<2*PI>  )  )*EXP<  <-l  .0/2.0>*Y*» 
***  ADD  TO  SUMS  FOR  EVALUATION  OF  INTEGRAL  *** 

SUM  ( 1 )  =SUM  ( .1 )  -Hd  <  I )  *  <  Z 1  -Z )  **2*F 
SUM(2)=SUM( 2  >  +  W  < I ) * ( Z2-Z ) **2*F 
SUM<3>=SUM<3>+W(I>*(Z3-Z>**2*F 
SUM<4)=SUM(4)+U< I >*(Z4-Z>**2*F 
SUM ( 5 ) =SUM <  5 ) +U ( I ) * ( 75-1 ) **2*F 
SUM ( 6 > =SUH  <  6  >  +M  < I ) *  < Z6-Z ) **2*F 
SUM(7)=SUM(7) FU<I > *< Z7-Z > **2*F 
SUM(8)=SUM(H)+U< I )*<Z8-Z>**2*F 
CONTINUE 

***  CALCULATE  CVM  STATISTIC  FOR  EACH  METHOD  *** 

DO  8  1=1,8 

CVM<I,J)=N*SUM<I> 

CONTINUE 

***  ADD  TO  SUMS  FOR  CVM  *** 

DO  9  1=1,8 

SUM1  <  I )  =SIJM  1<I)  +CVM ( I ,  J) 

CONTINUE 

*******  END  DO-LOOP  FOR  NREPS  SAMPLES  ******* 

CONTINUE 

***  CALCULATE  MSE'S  FOR  EACH  PARAMETER  *** 

BM=BM/NREF‘S 

B3=BS/NREPS 

BX=BX/NREPS 

BKM=BKM/NREPS 

BKS=BKS/NREPS 

BKX=BKX/NREPS 

BCM=BCM/NREPS 

BCS=BCS/NREPS 

BCX=BCX/NREPS 

BAM=BAM/NREPS 

BAS=BAS/NREPS 

BAX=BAX/NREPS 

HM=HM/NREPS 

HS=HS/NREPS 

HX=HX/NREPS 

HKM=HKM/NREPS 

HNS=HKS/NREPS 

HKX=HKX/NREPS 

HCM=HCM/NREPS 

HCS=HCS/NF;EPS 

HCX=HCX/NF<EPS 

HAM=HAM/NREF'S 


99 


HAS=HAS/NREF'S 

HAX=HAX/NREPS 

C  ***  CALCULATE  THE  RELATIVE  EFFICIENCIES  **# 

RBM=BM/BM 

RBS-BS/B8 

RBX-BX/BX 

RBKM-BM/BKM 

RBKS=BS/BKS 

RBKX=BX/BKX 

RBCM=Bh/BCM 

RBCS=BS/BCS 

RBCX-BX/BCX 

RBAM-BM/BAM 

RBAS=BS/BAS 

RBAX*BX/BAX 

RHM=BM/HM 

RHS-BS/HS 

RHX-BX/HX 

RHKM=BM/HKM 

RHNS=BS/HKS 

RHKX=BX/HKX 

RHCM=BM/HCM 

RHCS=BS/HCS 

RHCX=BX/J1CX 

RHAM=BM/HAM 

RHAS=BS/HAS 

RHAX=BX/HAX 

C  ***  CALCULATE  MEAN  CVM  *** 

DO  35  1=1,8 

MCVM( I )=SUM1 < I ) /NREPS 
35  CONTINUE 

C  ***  CALCULATE  THE  STD  DEV  OF  CVM  STATISTICS  *** 

DO  37  1=1,8 
SUM  < I ) =0 • 0 
37  CONTINUE 

DO  45  1=1,8 

DO  46  L» If NREPS 

SUM(  I  )=SUM<  I )  KCVM(  I  ,L)-MCVM<  I )  >**2 
46  CONTINUE 
45  CONTINUE 

DO  55  1=1,8 

SDCVM  ( I  >  =  <  SUM  <  I )  /NREPS  >**().  5 
55  CONTINUE 

C  ***  READ  IN  THE  DIVERGENCE  COUNTERS  FOR  THE  MD  METHODS  *** 
READ  (7,102)  NKB ,  NCB ,  NAD ,  NKH ,  NCI-I ,  NAH 
102  FORMAT (14, 14, 14, 1. 4, 14, 14) 

C  ##*  WRITE  ALL  RESULTS  TO  FILE  *** 

UR1TE(8t ' (21X, "SAMPLE  SIZE  *,I5)')  N 

WRITE(8f  '  (21X,  "TRUE  MU  "fFS^)')  TMIJ 

URITE(3f/(21X, "TRUE  SIGMA  ■,F5.2)/>  TSIG 

WRITE ( 0, ' ( 21X , *  TRUE  XT  *, F5.2,/)')  TXI 

WRITE (8, ' (12X, •MU', 13X, ■ SIG* , 14X , *  X T * ,5X , ■ DIV , / ) ' ) 

WRITE (8, ' (20X, ‘MEAN  SQUARE  ERRORS * , /) ' ) 
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WRITE(8» ' < • INT  :",F11.£,F16.£,F1A.£>'>  BM,BS,BX 

WRITE <8» '  <  "KOL.  :  "  ,F11 . A»F1£.A,F1£.£, IA> '  )  BKM ,  BKG » BKX ,  NKB 

WRITE  < 8 1 ' ( *  CVM  t  "  »F11.A»F1A«A»F16»£fI£)' )  BCM,BCSrBCXf NCB 

WRITE  (8, '  <  "A  D  t  "  ?F11 .£. FI  A. A,F1A. £»!£,/> '  )  BAM ?  BAG r  BAX  r  NAB 

WRITE <8»  '  (  *  ITR  J  "  ,FU . A , FI A . A , F t A . A > '  )  H;1,HS,MX 

WRITE < 0» ' < "KOL  J " ,F 11. A..F1A.A,  F1A, £,!£>' )  HKM , HKS , HKX f NKH 

WRITE* 8, ' * "CVM  J " ,F11.£,F1£.£,F1£.£,I£) ' )  HCH  f HCG « HCX , NCH 

WRITE ( 8,' ( "A-D  J" ,Fli.£,Fl£.£r F1A. £,!£,/) ' )  HAM r HAS r HAX t NAH 

WRITE < 8 » ' < 18X, "RELATIVE  EFFICIENCIES" f/ ) ' ) 

WRITE < 8f ' < " INT  J " ,Fll.A,Fl£.AfFl£.£) ' )  RBM,RBBrRBX 
WRITE ( 8 » ' ( " KOL  J " , Fll . A , FI 6 . A , F1A . A) ' >  RBKM , RBKS > RPKX 
WRITE(8» ' < "CVM  i"?F11.6fFl£»£,F16.A>'>  RBCM , RDCS , RBCX 
WRITE<8» ' ( "A-D  J " »Fll»6fFlA.A»FlA.A»/) ' )  RBAM f RBAS r RBAX 
WRITE(8f ' < "ITR  *"»F11*A»F1A.A»F1A»A)/)  RHM»RHS»RHX 
WRITE < 8» ' < "KOL  : " ,F1 1 . A , FI A . A , F 1 A . A ) ' )  RHKM , RHKS , RHKX 
WR I TE  ( 8 » '  (  "CVM  ",  "  ,F11 .  A  ,  FI  A .  A  » FI  A .  A  )  '  )  RHCH ,  RHCS ,  RHCX 
WRITE  <8r '  (  ‘  A-D  J"»Fll.A»FlA.AfFlA. £»/)')  RHAM ,  RHAS ,  RIHAX 
WRITE ( 8» ' ( 21X f " CVM  STATISTICS" ,/) ' ) 

WRITE (8, ' < 18X, "MCVM" ,11X, "SDCVM" ,/)') 

WRITE*  8»  #  <  "  INT  : " ,2X,F1£.A,F1£.£> ' >  MCVM* 1 ) ,SDCVM< 1 > 
WRITE(8» '  <  "KOL  :  "  ,2X.F1£.£,F1£.£>  '  )  MCVM(2)  ,SIJCVM(2) 
WRITE<8, ' < "CVM  J " ,2X,F1A.£,F1A.£> ' )  MCVM(T) , SDCVM<3> 

WRITE <8» ' < "A~D  : " ,2X,F1A.A,F1A.A,/) '  )  MCVM ( 4 > , SDCVM< 4 ) 

WRITE ( 8  *  '  <  " ITR  J " f2X,FlA.A,FlA.A) ' )  MCVM<5> ,SDCVM<5> 

WRITE (8 »  '  < "KOL  J " ,2X,F1A. A,F1A.A) ' >  MCVM *  6 )  » SDCVM ( A > 

WRITE <8» ' < "CVM  :",2X,F1A.A,F1A.A>'>  MCVM < 7) , SDCVM* 7) 
WRITE(8, ' ( "A-D  : " ,2X,F1£.6,F1£*£> ' >  MCVM * 8> , SDCVM* B > 

STOP 

END 
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